TENQUIST 
WHITTLE 
YARWOOD 



D.W.TENOUIST R.M.WHITTLE J.YARWOOD 



UNIVERSITY 



VOLUME 2 




UNIVERSITY OPTICS, Volume 2 

Optics is an essential part of all physics 
courses. UNIVERSITY OPTICS, published 
in two volumes, presents a concise and 
logically developed account of both geo- 
metrical and physical optics. The book 
recognises the need for the student's 
understanding to grow in conjunction with 
his knowledge of mathematics and other 
branches of physics. 

Volume 1 devotes its first six chapters 
mainly to geometrical optics. Thereafter the 
wave theory of light is Introduced, and the 
principles of interference and of Fraunhofer 
diffraction are discussed. The final chapter 
of the volume is concerned with resolving 
power. 

This second volume thoroughly explains 
the electromagnetic and quantum theories 
of radiation and then deals with the polarisa- 
tion of light and an investigation of multiple- 
beam interferometry. Spectroscopy is 
studied next, and a more advanced treatment 
of diffraction and interference is given. 
Chapter 7 is devoted to colour and colour 
measurement. After a detailed examination 
of the microscope, this volume concludes 
with a chapter covering such topics as lasers 
and holography. 

The text is fully illustrated throughout by 
diagrams, and questions for the student are 
included at the end of each chapter. This 
carefutty planned two-volume work covers 
the optics required for Part 1 of an Honours 
Degree in Physics, for a General Degree in 
Physics, or for an Ancillary Physics course. 
Volume 1 is excellent for first year students; 
this second, more sophisticated, volume 
provides the ideal book for students of optics 
in subsequent years. 

382 pages, including f69 illustrations 



£7 5( 



net 



D. W. TENQUIST was educated at Christ's 
Hospital, Horsham, Brighton College of 
Technology and Blrkbeck College, Univer- 
sity of London. He graduated with an 
Honours Degree in PhysicB In 1954 and 
obtained a Master's Degree in Crystal- 
lography in 1962. For two years he worked as 
a Patent Examiner and since 1956 has been 
a Lecturer in Physics at the Polytechnic, 
Regent Street, where he specialises in 
optics and crystallography. 

R. M. WHITTLE joined the Polytechnic, 
Regent Street, at which he is a Senior 
Lecturer in Physics, in 1961. Prior to this, he 
was for ten years a Senior Scientific Officer 
at the John Innes Institute, Hertford, where 
he was concerned with the investigation of 
the natural environment of plants and in the 
control and measurement of the climate of 
biological controlled-environment rooms. 
He carried out an exhaustive survey of con- 
ditions under glass and has had several 
papers published in the Journal of Agricul- 
tural Engineering. Previously, he had worked 
on the efficiency of domestic heating and 
combustion processes. His main interest In 
physics is the accuracy of experimental 
work, and he is particularly concerned that 
students should not lose much of the interest 
in science by failure to consider the statistics 
of observations. 

J. YARWOOD has been involved in tech- 
nical education since 1944, his main interest 
being in Degree courses in physics. He be- 
came Head of the Department of Mathe- 
matics and Physics at the Polytechnic, 
Regent Street, in 1953. He is the author of a 
two-volume work on Electricity, Magnetism 
and Atomic Physics for Honours Degree 
students, and is co-author of a book dealing 
with Electricity and Atomic Physics and 
intended for students on Advanced Level 
General Certificate of Education courses. 

During the War, he was a civilian instructor 
to the Royal Air Force; prior to this, he 
worked for six years at the Research Labora- 
tories of Electric and Musical Industries Ltd, 
and for two years at Adam Hilger Ltd (now 
Hilger and Watts Ltd). He is interested in 
research on vacuum physics and has 
written two books and twelve research 
papers on the subject. 



Jacket design by NT. P. Studio 






UNIVERSITY OPTICS 

Volume 2 



D. W. TENQU1ST, M.SC. 

R. M. WHITTLE, B.SC, PH.D. 

J. YARWOOD, M.SC, F.INST.P. 



LONDON ILIFFE BOOKS 



THE BUTTERWORTH GROUP 

Butterworth & Co (Publishers) Ltd 
London: 88 Kingsway WC2B 6AB 

AUSTRALIA 

Butterworth & Co (Australia) Ltd 
Sydney: 20 Loftus Street 
Melbourne: 343 Little Collins Street 
Brisbane: 240 Queen Street 

CANADA 

Butterworth & Co (Canada) Ltd 
Toronto: 14 Curity Avenue, 374 

NEW ZEALAND 

Butterworth & Co (New Zealand) Ltd 
Wellington: 49/51 Ballance Street 
Auckland: 35 High Street 

SOUTH AFRICA 

Butterworth & Co (South Africa) Ltd 
Durban : 33/35 Beach Grove 



First published in 1970 by 
Iliffe Books an imprint of 
the Butterworth Group 



© D. W. Tenquist, R. M. Whittle and J. Yarwood, 1970 
, ISBN 592 05055 6 
Printed in Hungary 



I ACC;-;:;S'GN No. 



86122 



<o. J 



.535- T£Wl 



J I rOoRmftj. 



Contents 



PREFACE 7 

chapter 1 Electromagnetic Theory of Radiation 1 

chapter 2 Quantum Theory of Radiation 34 

chapter 3 Polarisation of Light 92 

chapter 4 Multiple-beam Interferometry 151 

chapter 5 Spectroscopy 179 

chapter 6 More Advanced Treatment of Diffraction and 

Interference 249 

chapter 7 Colour and Colour Measurement 297 

chapter 8 The Microscope 312 
chapter 9 Coherence, Optical Masers and Holography 325 

appendix 359 

numerical values of constants 367 

answers to numerical exercises 368 

INDEX 371 



Preface 



In conjunction with its companion, Volume 1, this text is intended 
to cover the requirements in the subject of optics for the student 
preparing for Part 1 of an Honours Degree in Physics, a General 
Degree in Science, or Ancillary Physics to an Honours Degree in 
Chemistry or other main discipline. It is also suitable for students 
reading for the Higher National' Certificate or Diploma in Physics, 
or the Graduateship Examination of The Institute of Physics and 
The Physical Society. 

In general, this second volume is intended for the second year of 
a two-year course in optics at undergraduate level. It is assumed that 
the reader is familiar with introductory vector calculus, although 
much of the text can be understood without such knowledge. 

The text departs to some extent from the conventional treatment 
of optics in undergraduate books in that a considerable emphasis 
is placed on quantum theory in order to provide a theoretical back- 
ground to the subsequent study of spectra and the optical maser. 

Grateful thanks are due for their assistance in the preparation 
of this book to Dr Peter Morse of the Physics staff at The Polytech- 
nic, Regent Street, to those University authorities who have kindly 
permitted the inclusion of exercises from their examination papers, 
and to Mrs Elizabeth Bangham who typed most of the text. 

LONDON D.W.T. 

1970 R.M.W. 

J. Y. 



CHAPTER 1 



Electromagnetic Theory 
of Radiation 



1.1 The Electromagnetic Field and Radiation 

The complete electromagnetic spectrum from wavelengths in free 
space of 10- 11 cm or less up to those of several metres comprises 
y-radiation, X-radiation, ultra-violet radiation, visible light, infra- 
red radiation and radio waves. All these radiations essentially origi- 
nate from the movement of electric charge. For example, the emis- 
sion of visible light results from the change of energy of an atom 
or molecule when electrons in the atoms or molecules of a substance 
undergo a transition from a higher to a lower energy state conse- 
quent upon the substance being excited, as in a discharge, or 
molecules being set in vibration when the substance is heated. 
Again, radio waves are produced by the repeated accelerations and 
decelerations of electrons in a conductor (for example, the trans- 
mitter aerial) when a varying current is passed through the conduc- 
tor, whilst X-rays result from the rapid deceleration of electrons 
in motion when they impinge on a solid. 

An electric field accompanies electric charge : a free electron in 
space is the seat of electrostatic field lines; charged conductors 
contain in the region between and around them an electrostatic 
field. If the electric charge is in motion relative to the observer 
or his measuring instruments, the electrostatic field in motion 
results in the detection of a magnetic field by the observer. For 
example, electrons travelling in a wire constitute an electric current 
which is inevitably attended by a magnetic field in the surrounding 
medium. 

Thus, electric charge, electrostatic field and magnetic field, as 



2 UNIVERSITY OPTICS 

phenomena, are interdependent: the presence of one implies that 
of the other two. Electrostatic and magnetic fields are inevitably 
both present: whether or not the electrostatic or magnetic field 
effect predominates, as it clearly does in many observations, is 
a question of the magnitude of the relative speed and acceleration 
between the electric charge and the observer. 

The theory of this interdependence, leading to the electromagnet- 
ic theory of radiation, was first given in about 1865 by Clerk Max- 
well and forms one of the outstandingly original contributions in 
theoretical physics. 



1.2 The Electromagnetic Field: Maxwell's Equations 

The two concepts with which Maxwell was chiefly concerned and 
which were well established before his electromagnetic theory were 
both associated with magnetic fields in the vicinity of conductors. 
The first of these is Ampere's circuital law; the second is Neumann's 
equation which followed Faraday's laws of electromagnetic induc- 
tion (about 1831). Using consistent rationalised units, Ampere's 
circuital law may be expressed mathematically as 

JHcos6dl = i (1.1) 

or, in vector notation, 

JH.dl=i (1.2) 

where H is the magnetic field vector of magnitude H, the field 
strength, at a point on a closed path around the current i of magni- 
tude i, at which point there exists an element of path of length 
d/, vector d/, at an angle to the direction of H (Fig. 1.1). 

Maxwell's generalisation of this result was that the concept still 
applied even if no actual conduction current i existed, and that a time 
variation of an electrostatic field E in free space or a dielectric 
was also attended by a magnetic field. He introduced the concept 
of displacement current and showed that, if consistent rationalised 
units are employed, 

j D = - w (1-3) 

where the vector j D of magnitude j D is the displacement current 
density at a time t due to the time variation of the electric displace- 
ment D of magnitude D; as is well known, 

D = eE (1.4) 

E being the electric field strength, and e the permittivity of the 
medium. 



ELECTROMAGNETIC THEORY OF RADIATION 



DIRECTION OF dl 



DIRECTION OF 
MAGNETIC FIELD H 
ATdt 




C0N0UCT0R 



ANY CLOSED PATH 
AROUNO CURRENT 



Fig. 1.1. Ampere's circuital law 

In his generalisation of Neumann's equation 

90 

IF 



e — —- 



(1.5) 



where e is the e.m.f. around a circuit through which the magnetic 
flux is at time t, Maxwell again argued that an e.m.f. would be 
set up even if no actual conductor were present; this is tantamount 
to the concept that a time variation of magnetic flux in free space 
or a dielectric is accompanied by an electrostatic field, the e.m.f. 
over any element of path in this field being the electrostatic field 
strength multiplied by the path length. 

The consequences of these two generalisations, when expressed 
in the form of the appropriate differential equations, led to the 
famous Maxwell equations for the electromagnetic field.* Relative 
to a right-handed set of Cartesian axes, these equations are: 



dH z 


dH y 


dy 


dz 


dH x 


dH z 


dz 


dx 


dH y 


dH x 



dx 



dy 



= j x + 



Jy + - 



= jz + - 



dt 

dDy 

dt 
dD z 

dt 



(1.6a) 
(1.6b) 
(1.6c) 



* For proofs of these equations see, for example, J. H. Fewkes and 
J. Yarwood, Electricity and Magnetism, University Tutorial Press, Cambridge, 
Vol. 1, 1965. 



4 UNIVERSITY OPTICS 

There equations arise from the generalisation of Ampere's circuital 
law, where H is magnetic field strength, D is electric displacement 
and j is conduction current density; the suffices x, y and z denote 
components along the x, y and z directions respectively, and it is 
assumed that both conduction currents and displacement currents 
exist at the point (x,y,z) in the region concerned. The equations 

dE z dE y _ dB x 



(1.7b) 
(1.7c) 



dy 


dz 


dt 


dE x 

dz ' 


dE z 
dx 


dBy 

dt 


BEy 


dE x 


dB z 



dx dy dt 



where E denotes electric field strength and B magnetic flux density, 
also arise from the generalisation of Neumann's equation. 

In vector calculus form, these Equations 1.6 and 1.7 become, 
respectively, 

curl H=j+~ (1.8) 



and 



curl£ = — I?- (1.9) 



These, of course, apply in any system of coordinates. 

To these equations must be added the conditions arising from 
Poisson's analytical form of Gauss's theorem in electrostatics and 
in magnetism, which may be expressed as 

dD x dD y dD 2 

- + -—- + - 



dx dy dz 

or, in vector form, 

div D = q (1.10) 

where q is the charge per unit volume present in the region ; also 

dB x dB v dB z „ 

— --) ^--| = 

dx dy dz 

or 

divfl=0 (1.11) 

In the majority of cases of interest in optics, the medium is not 
electrically conducting, so conduction current does not exist; the 



ELECTROMAGNETIC THEORY OF RADIATION 5 

medium is also isotropic and uniform, so the electric displacement 
D is always in the same direction at any point as the electric field 
strength vector E, and the magnetic flux density vector B is always 
in the same direction as the magnetic field strength vector H. 
Furthermore, the" permittivity e and the magnetic permeability 
p are constant. Maxwell's equations (Equations 1.8 and 1.9) then 
become, respectively, 

curlff=e-^ (1.12) 

and 

tiH 

cm\E = -n— (1.13) 



1.3 Electromagnetic Waves in a Uniform Isotropic Dieletric 

Taking the curl of curl E in Equation 1.13 gives 

55// A 

curl cm\E = -ft curl — = - ft — (curl H) (1.14) 

A standard result from vector calculus for any vector such as E is 

curl curl E = grad div E- v*E (1.15) 

Equation 1.10 shows that div D = div (eE) = if no space charge 
is present, so div E = and 

curl curl E = — v 2 i? 
Equation 1.14 therefore becomes 

v*E = n^(cm\H) (1.16) 

Differentiation with respect to time of Equation 1.12 gives 

eF (cur,/7) = e -8?r 
Substituting this result in Equation 1.16, it follows that 

»2J7 

v2£ = /ie ^- < U7 > 



6 UNIVERSITY OPTICS 

where, in Cartesian coordinates, 

&E 5*E d*E 

V 8x 2 + 8^ + 8z 2 

On precisely similar lines, it is established that 

V*H = i*e^ (1.18) 

Equations 1.17 and 1.18 are the general differential equations 
of wave motion in three dimensions. They show that the variations 
of the electric vector E and of the magnetic vector H are propagated 
with a wave velocity of v, where 

* !£ =^ < M9 > 

and likewise for v 2 ff; the speed of propagation is given by 

v = -J^ (1.20) 

Where the medium is free space, y. = fio and e = eq. Using ration- 
alised m.k.s. units, 

/to = 4jtXlO- 7 Hm- 1 

and 

eo = -^-XlllXlO- 10 Fm- 1 

The speed of light in free space is therefore predicted by electro- 
magnetic theory to be 

1 _! 

C -V(MlXlO-») mS 

10» 

-ms _1 



VIM 

= axioms- 1 

This value agrees with experimental observation (see Chapter 5 
of Volume 1), and thus represents triumphant verification of the 
electromagnetic theory of radiation. 

Note that this calculation is correct dimensionally. Thus, the 
dimensions of inductance (unit, the henry) in the m.k.s. system of 



ELECTROMAGNETIC THEORY OF RADIATION 7 

units are_ [l~ 2 MUT- 2 ] and of capacitance (unit, the farad) are 
[I 2 M _1 L _2 T 4 ]. Consequently, the dimensions of inductance times 
capacitance are [T 2 ]. So l/v%«oeo) is in the units metre per second. 
For almost all the transparent dielectrics encountered as optical 
media for use in the visible spectrum, the magnetic permeability is 
very nearly equal to fi , that for free space. It follows from Equation 
1.20, therefore, that the speed of light v in a medium of permittivity 
e should be related to the speed c in free space by 

where e r is the relative permittivity of the medium, i.e. e = e r e . 



1.4 Plane Electromagnetic Waves 

Referring to a right-handed set of Cartesian axes (Fig. 1.2) suppose 
the wavefront at any instant of time is in a plane parallel to the 
plane yOz. The electric vector E and the magnetic vector H will have 
components in such a wavefront which do not vary with y and z 
because they are constant at the given instant of time. Therefore 
8£,/6>, dEJdz, dH^/dy and dH z /dz are all zero. It follows from 
Equations 1.10 and 1.11, when no space charge is present, that 



and 



81, 

dx 



SH X 

dx 



= 



(1.23) 



PLANE WAVEFRONT 
PARALLEL TO PLANE yOz 




Fig. 1.2. Plane wavefront referred to right-handed Cartesian axes 



UNIVERSITY OPTICS 



Thus, E x and H x do not vary with x, so there are no spatial varia- 
tions of the components of these fields in the direction of x. 

From Equation 1.6a, when no conduction current is present, 
j x = and it follows that 

dD x dE x 



dt dt 

and, from Equation 1.7a, it follows that 
BB X _ dH*_ 

"eT _ ** dt ' 



o 



o 



Consequently, time variations of the x-components of £ and H axe 
also zero. Admittedly, constant electric and magnetic fields may 
exist with components in the x-direction but only field variations 
are of any significance in wave propagation. 
The wave equation for the electric field (Equation 1.17) is 

8 2 £ 8 2 f 8 2 E 8 2 E 

Since E y and E z do not vary with v and z respectively, and E x does 
not vary with x, y or z, this expression for v 2 £ reduces to 

32E, d 2 E z ^_ 6 2 £ 
"8a: 2 + 8a- 2 _ /Xe dt 2 



As SE x /dt = 0, it follows that 

d 2 E y _ &E y 
~dx*~ ~ ^ 8/ 2 
and 



(1.24) 



3«E, &E Z .. 25) 

From Equations 1.24 and 1.25 it is clear that the wave motion is 
propagated along the direction of the x-axis. Combined with the 
fact already established that no time variation of E x occurs, it 
follows that the electric vector in the wavefront is perpendicular to 
the direction of propagation. The same analysis applies to the mag- 
netic vector. Thus, the wave motion is transverse, with the electric 
and magnetic vectors both in a plane perpendicular to the direction 
of propagation. 

If the wave be propagated in the positive direction of x with 
a wave velocity v — 1 A/Owe), the y and z components of the electric 



ELECTROMAGNETIC THEORY OF RADIATION 9 

and magnetic fields respectively may be expressed by the following 
equations : 

Ey=f x {x-vi) (1.26) 

■E,=Mx-vf) (1.27) 

H y = gl (x-vt) (1.28) 

H z = g 2 (x-vt) (1.29) 

where /i,/ 2 , gi and g 2 are periodic functions. Now from Equation 
1.6b, 

dH z dE y 

— — e - 



dx dt 

Substituting for H z from Equation 1.29 and for E y from Equation 
1.26 gives 

g^- \g*(x - vt)] = - e -gj [fi(x - vt)] 

= evfKx-vt) 

where /i is the first derivative of/i(x— vt). But, from Equation 
1.20, v = W(i"e)- Therefore 

Integrating, 

*«(*-»*) = \f—fi(.x-vt) (1.30) 

where a constant of integration is not involved because constant 
fields are not of concern. Likewise, employing Equation 1.6c, 

* gl (x-vt) = - -J—Mx-vt) (1 .31) 

Hence, Equations 1.26, 1.27, 1.28 and 1.29 can be written: 

E y =fi(x-vt) 
E z =f 2 (x-vi) 



H y =-]fjMx-vt) 
Hz=*\f-^Mx-vt) 



10 UNIVERSITY OPTICS 

The resultant electric and magnetic fields are E and H respec- 
tively, of magnitudes 

E = y/{E*+E\) = V{[fi(x-vt)f+[f2(x-vt)Y} 

and 

H = V(P*+H*) = }fj V{[f2(x-vt)f+[fi(.x-vtW) 

Therefore 

(1.32) 



»=fr 



Furthermore, the electric field vector E is at an angle 0i to the y-axis 
given by 

tan d 1 = %- 
and the magnetic field is at an angle 2 to the j>-axis given by 



But 



and 



tan 02 = „ 

Hy 



E^ = fi ix-vt) 
E y fi(x-vt) 

H, Mx-vt) 





H y fc{x~vi) 


Therefore 


tan 0i = —cot 2 


and 


e 1 +8 2 = 90° 



It follows that the electric field vector E is always at right angles to 
the magnetic vector Hina. plane wavefront propagated in a dielec- 
tric. 

To summarise, therefore, an electromagnetic vibration propaga- 
ted with a plane wavefront is such that the electric and magnetic 
vectors are perpendicular to one another in this plane wavefront 
and this plane is perpendicular to the direction of propagation. 



ELECTROMAGNETIC THEORY OF RADIATION 1 1 

1.5 Velocity of Electromagnetic Radiation and Refractive Index 

Equation 1.21 relates the wave velocity in a medium of relative 
permittivity s T to the velocity in free space: 

The refractive index n of a medium is defined by the following 
relationship; 

_ velocity of electromagnetic radiation in free space 
velocity of radiation in the medium 

Thus 

c 

n = — 
v 

or 

n = V^r (1.33) 

i.e. the refractive index should equal the square root of the relative 
permittivity for a dielectric for which the relative permeability is 
unity. 

This relationship is obeyed satisfactorily for gases, but does not 
at first appear to be correct for liquids and solids. For example, for 
distilled water n = 1-33 whereas e r = 81 and V £ r = 9. However, 
this large discrepancy is because e r = 81 if the measurement of 
relative permittivity is carried out under d.c. or low-frequency a.c. 
conditions. The high value obtained is due to orientation of the 
dipole water molecules in the electric field. At very high frequencies, 
this orientation is not effective. Presuming, therefore, that the re- 
fractive index n and the relative permittivity e r were both carried 
out at optical frequencies, Equation 1.33 would be correct for 
water and other liquids and solids. 



1.6 The Reflection and Refraction of Linearly Polarised Monochro- 
matic Light at the Boundary between Two Transparent Isotropic 
Dielectric Media 

At any boundary between two media 1 and 2, it can be readily 
shown by electrical theory that the vector E denoting the electric 
field strength is such that its components tangential to the boundary 
are continuous. This means that the component of E tangential to 
the boundary in medium 1 is equal to that in medium 2. If the 



12 UNIVERSITY OPTICS 

boundary is the xOz plane in Cartesian coordinates, 

E' X = E' X ' (1.34) 

and 

E' 2 = E' Z ' (1.35) 

where E' x is the resultant component of the electric vector E in the 
x-direction in medium 1, E' x ' is the resultant component of E in the 
x-direction in medium 2, and the z suffices are for the z-direction. 
Furthermore, the boundary conditions are such that the same 
result applies for the magnetic vector H, so 

H' x = H' x ' (1.36) 

and 

m = H' z ' (1.37) 

However, for the electric displacement vector D and the magnetic 
flux vector B, it is the components normal to the boundary that are 
continuous. Hence, with similar notation, 

D' y = D' y ' (1.38) 

and 

B' y = B' y ' (1.39) 

For optical media where the magnetic permeability is equal to 
that of free space (/xo) and where the permittivity e is constant 
throughout for a given isotropic dielectric material, B = (j, H 
and D = eE; thus Equations 1.38 and 1.39 become 

ei E' y = e 2 E' y ' (1.40) 

where ei is the permittivity of medium 1 and e 2 that of medium 2; 
also 

h; .= h;' (i.4i) 

Consider a simple harmonic wave of electromagnetic radiation in 
a collimated beam along AO in medium 1 of permittivity e x and 
incident at an angle / on the boundary in the xOz plane at which 
y = 0; this beam will give rise to a reflected beam along OB at an 
angle /' to the normal (specular reflection will occur if the boundary 
is smooth, as when light in air impinges upon polished glass) and 
a refracted beam along OC at an angle R to the normal in the 
medium 2 of permittivity e 2 (Fig. 1.3). 

It is assumed that the incident monochromatic radiation travel- 
ling along AO is in the plane xOy and is linearly polarised in a plane 
of which the direction is not specified. Conventionally, the direction 



ELECTROMAGNETIC THEORY OF RADIATION 



13 




Fig. 1.3. Reflection and refraction of light at the boundary between two trans- 
parent isotropic dielectric media 

of linear polarisation is the direction of the electric displacement 
vector in the wave. The equation for this propagated wave is 



E = E cos 2n 



U Ai) 



where E is the electric field vector at time t at a point P distant 
rffrom the origin at O, E is the peak value of E, k x is the wavelength 
of the radiation in medium 1, and Ti is the corresponding period 
If (x,y,z) are the coordinates of P, 

d= hx+mty+niz 

where h, m x and «i are the direction-cosines of OA with respect to 
the x, y and z axes respectively. Therefore 

E = Re {E exp j [co 1 t-«. 1 (l 1 x+m 1 y+n 1 z)]} (1.42) 

where (o 1 = Iw/Ti = 2jiv u n being the frequency, and <x x = 2tt/Ai. 
For the reflected beam, let the electric field vector be £" and the 
direction-cosines be / 2 , w 2 and n 2 , whilst for the refracted beam the 
corresponding symbols areE", l 3 , m 3 and n 3 . Denoting the z-com- 
ponents ofE and E byE z andE 0z , and correspondingly for £", E' , 
E and E' ', the following equations arise from Equation 1.42. 



14 UNIVERSITY OPTICS 

For the incident beam, 

E z = E 0z exp j [co 1 t-«. 1 (I 1 x+m 1 y+niz)] (1.43) 

For the reflected beam, 

E' z = E' 0z exp j [co 2 t-X2(l2X+m 2 y+n<iz)] (1.44) 

For the refracted beam, 

E' z ' = E' ' z expj [coat-xsQaX+may+riaz)] (1.45) 

where a> 2 = 2nv 2 and co 3 = 2jri> 3 , v 2 and v 3 being the frequencies of 
the vibrations in the reflected and refracted beams respectively; 
« 2 = 27i/A 2 and a 3 = 2n/X 3 , where A 2 and A 3 are the wavelengths of 
the radiation in the reflected and refracted beams respectively. 

It is clear from Fig. 1.3 that the resultant component ofE in me- 
dium 1 is now the sum of the two components E z and E' z (because 
both the incident and reflected waves are in medium 1), whereas 
the resultant component of £ in medium 2 is E' z because only the 
refracted wave exists in medium 2. Therefore 

E Z +E' Z = E' Z ' (1-46) 

at the boundary plane xOz where v = 0. This can only be true if 
Equation 1.46 is satisfied at the boundary for all values of x, z and 
t (i.e. if the coefficients of x inEquations 1 .43, 1 .44 and 1 .45 are equal) 
and similarly for the coeflicients of z and t. Equating the coefficients 
of/, 

0)1 = 0)2= w 3 

or 

j>i = j>2 = v 3 = v (say) 

The vibrations in the incident, reflected and refracted waves are 
therefore all of the same frequency; reflection and refraction does 
not cause any change of the frequency of the incident radiation. 

Because the incident and reflected beams are in the same medi- 
um, they must have the same velocity (Equation 1.20). As their 
frequencies are the same, they therefore have the same wavelength. 
Hence xi = x 2 . Equating coeflicients of z in Equations 1.43, 1.44 
and 1.45 and putting ai = « 2 = a, 

ani = a«2 = OC3W3 

But «i = because the incident beam is perpendicular to the z-axis. 
Therefore 

«i = n 2 = m — 



ELECTROMAGNETIC THEORY OF RADIATION 1 5 

It follows that all three beams, incident, reflected and refracted, 
must be in the same plane, the plane of incidence xOy, which con- 
tains the normal. This is the first law of reflection and also of refrac- 
tion in optics. 
Equating coefficients of x, 

a/i = a/2 = a 3 / 3 

As h = sin / and h = sin /', 

which is the second law of reflection: the angle of incidence equals 
the angle of reflection. 
Further, l 3 = sin R. Therefore 



or 



Now Ai = vi/v and A3 = V2.lv, where v± and v 2 are the wave 
velocities in media 1 and 2 respectively, the frequencies being the 
same. Therefore 

sin / vx 



a sin / = <x 3 sin R 




sin / 0C3 In 2n 
sin R ~ a — A 3 ' Ai — 


Ai 
A 3 



sin R v s 
Thus, from Equations 1.20 and 1.33, 

ft" 



sin / 
sin R 



where n is the refractive index of medium 2 relative to that of medi- 
um 1. This is the second law of refraction, known as Snell's law. 



1.7 The Reflection and Refraction of Light at a Boundary: FresnePs 
Equations 

Now that it has been shown that the electromagnetic theory is 
consistent with the well-known laws of reflection and refraction, 
a more important development from the concepts of Section 1.6 is 
to consider the state of polarisation of the reflected and refracted 
light in relation to that of the incident light. 



16 



UNIVERSITY OPTICS 



The direction of the electric vector E in the light waves can be 
anywhere at a particular time /. Whatever this direction, it can be 
resolved into E, parallel to the plane of incidence, which also con- 
tains the reflected and refracted rays, as shown, and E, perpendicular 
to this plane. The r-direction is the same as the z-direction in Fig. 
1.3. Therefore E r is considered positive if it lies in the positive direc- 
tion of z and negative if in the opposite direction. To specify the 
positive direction ofi?;, let it be such that the vector E, in the plane 
of incidence is to the right-hand side of the direction of propaga- 
tion; negative £/ then corresponds to the left-hand side (Fig. 1.4). 




Fig. 1.4. Reflection and refraction at a boundary with the positive directions 
of the components E t shown 



Equations 1 .43, 1 .44 and 1 .45 can be written for the incident beam 
with electric vector components E lt and E lr , for the reflected beam 
with components E 2 , and E 2r , and for the refracted beam with com- 
ponents Ezi and E 3r ■ These equations can be simplified because it has 
been shown that co i = co% = C03 = co,xi = x 2 = «, «i = "2 = »3 = 
and h = h- Hence, for the incident beam: 

Eu = E101 exp j [cot-cc(hx+miy)] (1.47a) 

Ei r = Eior exp j [cot - a(/ix + Wi v)] ( 1 -47b) 

For the reflected beam: 

E 2 , = £20/ exp j [cot - x(hx + m 2 y)] (1 -48a) 

E 2r = E 20r exp j [cot - <x(/i* + m 2 y)] (1 .48b) 



ELECTROMAGNETIC THEORY OF RADIATION 17 

For the refracted beam : 

E 3 i = E301 exp j [cat - OL 3 (lax + m 3 y)] ( 1 .49a) 

E 3r = E ZOr exp j [cot- x 3 (l 3 x+ m 3 y)] (1 .49b) 

From the boundary condition represented by Equations 1.35 and 
1.46 at j = 0, 

E lr +E 2r =E 3r (1.50) 

Applying the boundary condition of Equation 1.36 and referring to 
Fig. 1.4 (in which H can be considered to replace E) it is seen that 

H' x = H v cos /- H 2 , cos I = H' X ' = H 3 , cos R (1.51) 

Equation 1.32 shows that 

\ Mo 

H21 = 1/ ■ — E 2r 



and 

' £2 



Jr.-/- 



^3r 
fJ-0 



Therefore, substituting in Equation 1.51, 

(E lr ~E 2r ) cos / = (£ 3 , cos R) Vj- = nE 3r cos R (1.52) 

where « is the refractive index of medium 2 relative to that of 
medium 1. The boundary condition of Equation 1.34 gives 

E' x = (E lt - Em) cos I = E' x ' = E 3I cos R (1 .53) 

whilst boundary condition of Equation 1.37 gives 

H lr +H 2r = H 3r (1.54) 

Substituting H lr = V(ei/j"o)£i/, H 2t = V(ei/t*o)E 2 , and H 3r = 
^/(e 2 lix )E 3l derived from Equation 1.32, it follows that 

Ey+En = V-j-Eai = nE 3 , (1.55) 



18 UNIVERSITY OPTICS 

The results following from the application of the boundary condi- 
tions are thus: 

Eir+Efr = E% r 

(E lr —E 2r ) cos / = nE 3r cos R 
(EU—E21) cos / = E 3 i cos jR 
Ev+Ezi = nEsi 

If the expressions for electric field components given by Equa- 
tions 1.47, 1.48 and 1.49 are substituted in these four equations, 
a simplification results from the fact that at the boundary y = 0; 
the exponential factors in Equations 1.47, 1.48 and 1.49 are then all 
equal because as/3 = a/i, as shown in Section 1.6. For convenience, 
theamplitudes£io/,.Eior,£'2o/, • • • ofE u ,E lr ,E 2l , ... are replaced 
by Ay,, Ai r ,A 2l , • • • Then the amplitude values for Equations 1.50, 
1.52, 1.53 and 1.55 become, respectively: 

A lr +A 2r = A 3r (1.56) 

(A lr -A2r) cos I = nA 3r cos R (1-57) 

{An- An) cos / = An cos R (1.58) 

Au+A 2l = nA 3 , (1.59) 

Equation 1.57 divided by Equation 1.56 gives 

(A lr -A 2r ) cos / = n(Air+A 2r ) cos R 

Therefore 

_ cos I—n cos R 

A2 '~ Alr ncosR+cosI 

Substituting n = (sin J)/(sin R) from Snell's law, 
_ cos I sin R— sin /cos .R 
2r = lr sin I cos R+ cos /sin* 

Thus 

si n (/--R) n ^y. 

A * = - A " S m(I+R) (160) 

which relates the amplitude of the perpendicular component of the 
electric vector in the reflected beam to that in the incident beam. 
Equation 1.58 divided by Equation 1.59 gives 

n(Ay — An) cos / = (An + An) cos R 

Substituting n - (sin /)/(sin R), 

sin / cos /— sin R cos R 



A 2 i = An 



sin R cos R+sin I cos / 



ELECTROMAGNETIC THEORY OF RADIATION 19 

Therefore 

sin 27- sin 2R 



An 



sin 2/+ sin 2R 



l. e. 



a - A tan(/-£) n fin 

AassAu \an(I+K) (L61) 

which relates the amplitude of the parallel component of the electric 
vector in the reflected beam to that in the incident beam. 

To relate the amplitude A% T of the perpendicular component of 
the refracted beam to that in the incident beam, substitution of 
A^r from Equation 1 .60 in Equation 1 .56 gives 



sin (7+7?) J 

_ sin (7+ it) -sin (I-R) 
~ ' sin (I+R) 

Thus 

. . 2 sin R cos I ,. „. 

A *' = A *' sin (I+R) (L62) 

Finally, to relate the amplitude A 3 i of the parallel component of 
the refracted beam to that in the incident beam, substitution of A 2 i 
from Equation 1.61 in Equation 1.58 gives 

. . cost l. sin 21— sin 2R\ 



= A 



cos R \ sin 27+ sin 2R ; 
2 sin 27? cos 7 



11 cos 7J(sin 27+ sin 27?) 
Thus 

. . . sin R cos 7 
^ = 4J " sin27 + sin27< ( L63 > 

Equations 1.60, 1.61, 1.62 and 1.63 are the Fresnel equations. 

The situation where the light is at normal incidence is of special 
interest, since then / = R — 0°. Fresnel's equations, as derived 
in the general case above, are thus indeterminate. For example, 
Equation 1.60 immediately leads to 

A^ =—A\ T -r- 



20 UNIVERSITY OPTICS 

To explore normal incidence, it is therefore necessary to return 
to the initial Equations 1.56, 1.57, 1.58 and 1.59. With / = R = 0° 
(i.e. cos / = cos R = 1), these equations become respectively: 

A lr +A 2r = A 3r (1.64) 

A lr -A 2r = nA 3r (1.65) 

A u -A 2 i = Asi (1.66) 

A11+A21 = nA 3l (1.67) 

From Equations 1.64 and 1.65, eliminating A 3r , 

n(Ai r +A 2r ) = A lr — A 2r 
Thus 

Aar = - Alr 7^1 (1>68) 

where n is the refractive index of medium 2 with respect to that of 
medium 1, as before. Similarly, eliminating A 3l from Equations 
1.66 and 1.67, 

n(Au— A 2I ) = Au+A 2 i 
Therefore 

A„ = Au^ (1.69) 

Inserting A 2 , from Equation 1.68 in Equation 1.64, 



(1.70) 



A 3r = A lr 1 1 — r I = A lr — — 

\ n+1/ n+\ 

and inserting An from Equation 1.69 in Equation 1.66, 

^M'-^rH 1 '^ (171) 

Equations 1.68, 1.69, 1.70 and 1.71 are Fresnel's equations for 
normal incidence. 

With normal incidence, the reflection coefficient (defined as 
the ratio of the incident intensity to the intensity of the reflected 
beam) is easily worked out. Recalling that intensity is proportional 
to the square of the amplitude, it is clear that the intensity of the 
incident beam is proportional to A\ r -\-A\,, whereas the intensity 
of the reflected beam is proportional to A 2r +A 2 i. Therefore 



reflection coefficient = 



A 2 r "I" A 2 i 
Air + A u 



ELECTROMAGNETIC THEORY OF RADIATION 21 

Substitution from Equations 1.68 and 1.69 gives 

reflection coefficient = ^K*- !)/(«+ 1)? + ^- l)/(i»+ DP 

Therefore 

In— 1\ 2 
reflection coefficient = | 7 (1.72) 

\»+i; 

With light incident normally in air on glass of refractive index 
1-5, the reflection is therefore (0-5/2-5) 2 XlOO = 4%. 



1.8 The Reflection and Refraction of Light at a Boundary: Practical 
Results 

Fresnel's equations, deduced in Section 1.7, give the amplitudes 
of the parallel components of the electric field vectors in the re- 
flected and refracted light in terms of the amplitude of the parallel 
component in the incident light, and similarly for the perpendicular 
components. In practice, the incident light may be unpolarised, 
partly polarised or fully polarised. Whatever the state of polar- 
isation, the electric vector can be resolved into parallel and perpen- 
dicular components. If the electric vector amplitudes of these 
components are known, those in the reflected and refracted light 
can be calculated by means of Fresnel's equations. 

As an example, let the amplitudes A v and A ir of the perpendic- 
ular components in the incident beam of light both be unity, 
and suppose incidence is in air on a polished glass surface of 
refractive index n = 1-5. It is required to plot the amplitudes A 2 i 
and A 2r in the reflected beam, and A 3 , and A 3r in the refracted 
beam against the angle of incidence /. 

It should be noted that the case where A u and A Xr are equal 
is of interest when the incident light is unpolarised, because unpolar- 
ised light can be resolved into components of equal amplitude. 

Curve 1 of Fig. 1.5 shows A 2 , against /. For / = 0°, it is seen 
from Equation 1.69 that A<u = 0-2. As / increases, A21 decreases 
(as is shown by Equation 1.61) to reach zero when the denominator 
of Equation 1.61 is infinite; i.e. I+R = 90° when tan 90° = «,. 
At this particular angle of incidence, the electric vector vibration 
in the plane of incidence vanishes and only the perpendicular 
component A 2r remains. Therefore, when the reflected and re- 
fracted rays are at right angles to one another, the reflected beam 
is plane-polarised with its electric vector vibrating perpendicular 
to the plane of incidence. This occurs for a particular angle of 



22 UNIVERSITY OPTICS 

incidence I B . As (sin I B )I (sin R) =n (Snell's law) and R=90°—I B , 
sin 1 B 



cos In 



= tan Ib = n 



(1.73) 



where I B is the Brewster angle of incidence (Section 3.2). With 
n = 1-5, this Brewster angle is tan -1 1-5 = 56-3°. 



0-2 



Z, 



-0-2 































CURVE 3 










CUI 


VE A y 








/*Jl 
















*3r 










*iT" 




^CUi 


VE 1 
















565° 












*2r 


















CUI 


VE 1^ 

















































-0-8 



10 20 30 40 50 60 70 10 90 

ANSLE OF INCIDENCE. I (degrees) 

Fig. 1.5. Variation with angle of incidence of amplitudes of parallel and perpen- 
dicular components of reflected and refracted beams (n=l'5 and the amplitudes 
of components in the incident beam are both unity) 

For angles of incidence J greater than I B , I+R exceeds 90° and 
tan(J+.R) is negative. Equation 1.61 shows that Ay/Ajj is then 
negative: the direction of the electric vector is reversed and the 
vibrations in the reflected light are 180° out of phase with those 
of the incident light. As /increases to 90°, corresponding to grazing 
incidence, A 2 i becomes equal to — 1-0. 

The plot of A^ against / is shown in curve 2 of Fig. 1.5. Equation 
1.60 shows that A 2r /Ai r is always negative, so the phase is reversed. 
At J = 0°, A^ = -0-2 (Equation 1.68); and as / increases, A* 
increases to become — 1-0 at / = 90°. 



ELECTROMAGNETIC THEORY OF RADIATION 



23 



Curve 3 of Fig. 1 .5 shows A& against I. Equation 1 .63 is involved. 
At I = 0°, Azi = 0-8 (Equation 1.71); A 3 i/Au is always positive, 
and Azi decreases to zero at 7 = 90°. 

The plot of A 3r against I is shown in curve 4 of Fig. 1.5. 
As is seen from Equation 1.62, the variation of A 3r departs only 
slightly from the similar curve for A aJ . 

From these results, the reflection coefficients for various angles 
of incidence may be obtained, since intensity is proportional to 
the square of the amplitude. Thus, from Fig. 1.5, where the com- 
ponents of Ati and A lr are both unity, the reflected intensities are 
given by plotting A\i and A\, against I. 

The statement is often made that there is a phase change of 180° 
when light is reflected at the surface of an optically denser medium, 
i.e. the light is incident in a medium of refractive index n\ and 
encounters the surface of a medium of refractive index n 2 , where 
n 2 =» "i- Study of one of Fresnel's equations (Equation 1.60) 
shows that this is always true for the electric vector components 
perpendicular to the plane of incidence (see also curve 2 of Fig. 1 .5), 
but is only true for the components parallel to the plane of inci- 
dence if the angle of incidence exceeds the Brewster polarising 
angle, as is shown by Equation 1.61 for I+R > 90°. Thus, for 
angles of incidence I less than the Brewster angle, A^/Au is positive 
and apparently the electric vector in the reflected beam is in phase 
with that in the incident beam. 

However, in Fig. 1.6 let AO be a beam of light at angle of inci- 
dence Ji, less than the Brewster angle, the incidence being near 
normal. Suppose the arrow perpendicular to AO represents En, 
the electric vector in the incident beam. This is assumed to be in the 
positive direction, so is drawn to the right-hand side of the direction 




£» OBSERVER 2 



MEDIUM Of REFRACTIVE 
INDEX n, 



n,>n 2 

Fig. 1.6. Phase change on reflection 



24 UNIVERSITY OPTICS 

of the incident light. In the reflected beam OB, £W is again positive 
(no phase change) and is again drawn to the right-hand side of 
the direction OB. But the light direction is reversed on reflection. 
Hence, to an observer 1 receiving light in the direction roughly 
along the normal ON, the vibrations in the incident and reflected 
beams would appear to be in anti-phase. 

Furthermore, when the incident beam CO is at an angle of 
incidence h which exceeds the Brewster angle, h being nearly 90°, 
Eu is drawn to the right of CO if it is positive; on the other hand, 
£ 2 ! in the reflected beam OD would be drawn to the left of OD, 
as it will be negative because E^/Eu is negative when I+R> 90° 
(Equation 1.61). To an observer 2 receiving light resulting from 
reflection with grazing incidence, the electric vector vibrations will 
appear to be in anti-phase. 

The initial statement about the phase change of 180° is therefore 
true for the parallel components as well as the perpendicular ones 
if the viewing conditions are appropriate and, indeed, are those 
which usually prevail in interference experiments between coherent 
incident and reflected beams of light. 



1.9 Electromagnetic Theory of the Scattering of Light 

The electromagnetic theory cannot give a detailed explanation 
of the effects of the scattering of light by molecules of a material; 
instead, quantum theory is required. In quantum theory, the light 
is considered to be propagated in the form of photons, where the 
energy of a photon at frequency v is hv, h being Planck's constant. 
Certain important aspects of the scattering of light can, however, 
be satisfactorily discussed by the use of electromagnetic theory. 
The basic situation is that the scattering material concerned is a 
dielectric consisting of an assembly of molecular electric dipoles; 
the electric vector alternating in magnitude in the impinging electro- 
magnetic wave will be able to set the bound charges in the dipoles 
in motion. To gain an insight into the ideas involved, consider 
the dielectric to be water. The hydrogen atoms have positive nuclei 
each of charge +e, where e is the electronic charge. The oxygen 
atom has a nucleus of charge +Se. These three nuclei will have 
an equivalent 'seat' of positive charge + 10e somewhere within 
the molecule. The electron in orbit around each hydrogen atom 
will be intermingled in its motion with the outermost electrons 
of the oxygen atom. The molecule is electrically uncharged as a 
whole, because the + 10*? charge on the nuclei is balanced by the 
— lOe charge on the electrons. The equivalent 'seat' of negative 
charge due to the electrons does not coincide with the 'seat' of 
positive charge: there is a small distance between them. Hence the 



ELECTROMAGNETIC THEORY OF RADIATION 25 

water molecule is, electrically, a dipole, and has a dipole moment. 
Many molecules such as water are permanent dipoles; many 
others have a zero or very small dipole moment which is increased 
on subjection to an electric field, i.e. these molecules become in- 
duced dipoles. 

The nuclei have much larger masses than the electrons, because 
the proton mass is 1,836 m e and the neutron mass is 1,838 m e , 
where m e is the rest mass of the electron. When an electromagnetic 
wave impinges on a molecule, it is consequently only the electrons 
which undergo any significant motion in the alternating electric 
field. In a dipole, such electrons are bound charges, not free as in a 
conductor. Therefore, any tendency for the electric field in the 
electromagnetic wave to set these electrons in motion is retarded by 
the attraction of the electron to the positive nuclei and also by 
forces between molecules. The intermolecular forces will be much 
stronger in solids and liquids, where the separations between 
molecules are of the order of 10 -8 cm, than in gases where, at 
atmospheric pressure, the separation is a few hundred times greater. 
When the electromagnetic wave impinges on the dielectric 
materia], the bound electrons in the dipoles are set into motion 
and they will oscillate under oscillatory forces in the direction of 
the alternating electric vector in the wave. It is fundamental to 
the production of electromagnetic radiation that it is a result of 
accelerated electric charge. The forced oscillations of the bound 
electrons will consequently result in electromagnetic radiation; 
this will be the scattered light. 

The bound electrons in the material will themselves have a 
natural frequency or,-in general, a series of natural frequencies 
of vibration. If the frequency of the impinging electromagnetic 
wave is not equal to the frequency of vibration of the bound 
electrons, the forced oscillations will be of small amplitude and 
subject to damping. The scattered radiation will therefore be 
weak and of the same frequency as that of the incident wave, but 
containing also harmonics. In certain circumstances, scattered 
radiation of lower frequency (longer wavelength) can also be pro- 
duced, as in the Raman effect (Section 2.22) and in fluorescence, 
but these effects cannot be adequately explained without recourse 
to quantum theory. 

The electromagnetic theory can also explain aspects of the 
distribution of intensity with direction of the scattered light. Let a 
plane-polarised electromagnetic wave with the electric vector E 
vibrating along the j-axis and the magnetic vector H in the perpen- 
dicular z-direction be travelling along the x-axis to encounter a 
bound electron at point O (Fig. 1.7). The electron will execute 
forced vibrations along the j-axis under the action of the oscillating 
E and H. This vibration of the electron along the y-axis cannot 



26 



UNIVERSITY OPTICS 



produce a transverse electromagnetic wave along the j>-axis, so 
the intensity of the scattered light along O A and OB will be zero. 
The most intense scattered light will be along the radii from O 
terminating on the equatorial circle CEFG. This scattered light 
will be plane-polarised with its electric vector vibrations in a 
direction parallel to the vector E in the incident radiation. Let its 
amplitude be A at any point on the circle CEFG. At any other point 
P on the sphere around O, the component of the vibration of the 
electron projected perpendicular to OP will be effective. If angle 




Fig. 1.7. Distribution of scattered light around a bound charge at O due to plane- 
polarised light incident along the x-axis (circle CEFG is in the plane xOz) 

POF is 6, the amplitude of the wave in the direction to the plane 
CEFQ is A cos 0. Hence, the intensity varies as cos 2 0. 

If the light incident along x on the electron at O is unpolarised, 
in a finite time the vector E will be equally likely to occupy any 
direction in a plane perpendicular to the direction of propagation 
along the x-axis. The light scattered along the x-axis is likewise 
unpolarised, whereas that scattered in any direction in the plane 
yOz will clearly be plane-polarised. 

These predictions assume that the axis of the induced dipole 
is in the direction of the electric vector in the incident radiation. 
This is not necessarily so; if it is not, the scattered light will be 
partially polarised. 



1.10 The Electromagnetic Theory of the Absorption of Light 

As in scattering, a full account on the basis of electromagnetic 
theory is inadequate and the quantum theory has to be applied. 
Again, important ideas can nevertheless be put forward by utilising 
electromagnetic theory. 



ELECTROMAGNETIC THEORY OF RADIATION 27 

The situation is as in Section 1.9 in that the forced vibrations 
induced in the bound charge in a dielectric are concerned, but 
free electrons in a conductor must now also be considered. The 
strongest absorption will occur when the frequency of vibrations 
in the incident electromagnetic wave is equal to a natural fre- 
quency of vibration of a bound charge in the dielectric. Resonance 
will then occur, the essential origin of absorption spectra being 
concerned here. For a conductor, the free electrons available will 
be vibrated in the direction of the oscillatory electric vector, and 
this will occur largely irrespective of the frequency of the incident 
radiation. The induced electric currents resulting will cause energy 
dissipation, and strong absorption of the incident light energy will 
occur. 

1.11 The Theory of Dispersion 

Electromagnetic theory leads to Equation 1.33 

n = y/e t 

for the refractive index n of a transparent dielectric medium of 
relative permittivity e r and a relative permeability of unity. There 
is no immediate indication of a variation of refractive index with 
the frequency /of the radiation, i.e. no indication of dispersion. 
Lorenz in 1880 and Lorentz in 1909 developed the electro- 
magnetic theory of dielectrics to show that dispersion came about 
because of the molecular dipole structure of materials, i.e. because 
of the presence of bound electrons. In the 'macroscopic' electro- 
magnetic theory, the setting in motion of bound charges on the 
passage of an electromagnetic wave is not taken into account. 
The interaction of light with matter on the molecular scale requires 
explanation in terms of quantum theory, but the Lorenz-Lorentz 
development of classical electromagnetic theory leads to valuable 
results. 

To give a simplified account of this Lorenz-Lorentz theory, 
suppose an electric field E causes electric polarisation P of a 
medium of relative permittivity e r . In rationalised units, a well- 
known result from electrostatics is 

D = e E+P = eos r E (1.74 

where D is the electric displacement, e is the permittivity of free 
space, and P (the polarisation) is the electric moment per unit 
volume of the dielectric. Therefore 



28 UNIVERSITY OPTICS 

and Equation 1.33 gives 

e E 

Suppose the polarising electric field E at time t is caused by a 
plane electromagnetic wave passing through the dielectric medium 

where 

E = E exp yat 

E being the peak value of the electric vector in the wave, and 
co - 2nv where v is the frequency of the wave. This alternating 
electric field E sets the bound electrons in the molecular dipoles 
in motion. The electron is subject to an elastic restoring force 
assumed to be proportional to its displacement x from its equi- 
librium position. This restoring force will be given by m/ogx 
where co /2jr = v is the natural frequency of vibration of the elec- 
tron in the material. In addition, the electron will be subject to a 
resisting or damping force assumed to be r per unit mass per unit 
velocity. The equation of motion of the electrons, each of charge 
e and mass m e , will therefore be 

m e (x+rx+co%x) = Ee 
If r is assumed to be negligibly small for simplicity, then 

s+ ^ * = **^ (1.76) 

trig *'*e 

Putting x = x exp )cot, where x is the peak displacement which 
is assumed to have the same frequency as and to be in phase 

with E, 

x = —co 2 x expyot = — co 2 x 

and Equation 1.76 becomes 

*(«*-«*)=£ (1-77) 

frig 

The dipole moment due to the displacement of one electron through 
a distance x is ex. If there are N electrons per unit volume, the total 
electric moment per unit volume = P = N ex. Substituting for x 
from Equation 1.77, 

Ne*E 

~ mJcol-coP) 
Substituting this expression for P in Equation 1.75, 

. . Ne * 

n — 1 -I 7-5 sr 

m e e (tog-o) 2 ) 



ELECTROMAGNETIC THEORY OF RADIATION 

or, putting co = 2jiv and co — 2nv, 

Ne 2 



« 2 = 1 + 



4ji 2 m e e (vl-v 2 ) 



29 



(1.78) 



If it is assumed that Ne 1 is small, i.e. that P is small compared with 
eqE in Equation 1 .75, 

Ne 2 
87t 2 m e e (rg-j' 2 ) ' 



THEORETICAL GRAPHS (DAMPING IGNORED) 




HARKED ABSORPTION AT FREQUENCY w„ 



Fig. 1.8. Variation of refractive index with frequency 



This equation gives the variation of refractive index n with 
frequency v of the radiation. At v = v , n becomes infinite. This is 
because of strong absorption of the radiation at the frequency v , 
corresponding to a line in the absorption spectrum of the material! 
A more complex theory including, in particular, a damping term 
represented by a finite value of r, the resisting force per unit mass 
per unit velocity, would show the discontinuity in the plot of n 
against v at v = v but without n becoming infinite. The full line 
in Fig. 1.8 shows a typical plot from Equation 1.79. The abrupt 
discontinuity at the absorption line is not experienced in practice; 
the dashed curve shows the experimental result obtained in the 
region of the absorption. 

Equation 1.78 and Fig. 1.8 apply to a gas in which the electric 
polarisation P would be weak; this would be the case except at 
high pressure. It is also assumed that only one absorption band is 
present. Nevertheless, Fig. 1.8 illustrates well the marked change 
in refractive index on either side of the absorption line. 



30 UNIVERSITY OPTICS 

The variation of refractive index n with wavelength A is often 
represented by the empirical equation of Cauchy 

n = A + ^ (1.80) 

where A and B are constants for a given material. The dispersion 
dw/dA is given by 

dn __2B 

dX~ A 3 

This equation represents fairly satisfactorily normal dispersion, 
for which n decreases with increase of wavelength and so decrease 
of frequency. In Fig. 1.8, n decreases with decrease of frequency 
for curves 1 and 2, but at frequencies v in the neighbourhood of v 9 , 
the resonant absorption frequency, the dashed curve 3 shows that 
the refractive index n decreases with increase of frequency. This effect 
is known as anomalous dispersion and is exhibited by materials in 
the neighbourhood of frequencies at which absorption is marked. 

Many common optical media (for example, glasses and water) 
do not have electron resonance frequencies in the visible region, 
so for practical optical purposes they exhibit normal dispersion. 
They will show anomalous dispersion in the ultra-violet and infra- 
red regions where absorption is marked at particular frequencies. 
Sodium vapour, on the other hand, shows marked absorption 
electron resonance frequencies at corresponding wavelengths of 
5,890 A and 5,896 A. Sodium vapour will therefore exhibit anom- 
alous dispersion in the visible spectrum in the region around 
5,893 A. 

Replacing v by c/A and v by c/A in Equation 1.78, where c is 
the velocity of light in free space whilst A and A are wavelengths 
(in free space) corresponding to v and v respectively, gives 

« 2 = 1+ a-.*-. _ r/.»mx mum = l + : 



4jr2/w e e [(c 2 /Ag) - (c 2 /^)] T 4^m e e c 2 (A 2 - Ag) 

This assumes that the material has only one absorption band at 
the wavelength A , which is a constant for a given dielectric. There- 
fore 

n2 = l + J^% (L81) 

where A' is a constant for a given material. Equation 1.81 is Sell- 
meier's formula. 



ELECTROMAGNETIC THEORY OF RADIATION 



31 



If the wavelength A is far removed on the short wavelength 
side from the region where the dispersion is required, A 2 » A§; so 



n 2 = 1+A 



'('-r- 



1 + A' + 



A 2 



Therefore 



n=l + A' + ^- 



which is of the same form as Cauchy's equation where, in Equation 
1.80, A = 1+A' and B = A'%/2. 




Fig. 1.9. Normal and anomalous dispersion of quartz 



In general, there will be a number of absorption bands in the 

spectrum at wavelengths X u A 2 , A 3 , Equations 1.78, 1.79 and 

1.81 then have to be modified. In particular, Sellmeier's equation 
will be of the form 



1+1 



AM 



j 1*-1) 



(1.82) 



where A } is proportional to the number of electrons of which the 
natural vibration frequency corresponds to a wavelength in free 
space of Ay. 

Equation 1.78 is of interest in connection with X-rays. The 
frequencies involved in X-radiation are much larger than electron 
resonance frequencies, soi>» v . Therefore, the term 



Ne 2 



-iVe 2 



4n?m e e (vl — v 2 ) An 2 m e e v 2 
is negative and the refractive index n is less than unity. 



32 UNIVERSITY OPTICS 

Finally, a typical experimental result in which the refractive 
index n is plotted against wavelength X for a transparent optical 
medium having a marked absorption band in the infra-red is 
exhibited by quartz (Fig. 1 .9). 



Exercise 1 

Note. In the exercises at the end of each chapter, the abbreviations in brackets 

at the end of a question denote the university or institution concerned. 

These abbreviations are as follows. 
L.G. B.Sc. General Degree of the University of London 
L.P. B.Sc. (Special) Physics Degree of the University of London 
L.Anc. Ancillary examination in Physics to the B.Sc. (Special) Chemistry 

Degree of the University of London 
City B.Sc. Degree in Applied Physics of the City University 
M.P. B.Sc. Honours Degree in Physics of the University of Manchester 
Poly B.Sc. Honours Degree in Physics (C.N.A.A.) of The Polytechnic, 

Regent Street, London, W. 1 

The values of physical constants needed in numerical examples are given on 
page 367. 

1. For a plane electromagnetic wave in a non-conducting medium, establish 
that in the wavefront at a given time the magnitude of the magnetic vector 
divided by the magnitude of the electric vector is equal to the square root 
of the permittivity of the medium divided by the square root of the permea- 
bility, and establish also that these two vectors are perpendicular to one an- 
other. 

2. Derive from the appropriate equations in electromagnetic theory for a di- 
electric medium of which the relative magnetic permeability is unity, that the 
refractive index is numerically equal to the square root of the relative per- 
mittivity. Comment on this result in relation to experimentally determined 
values of the refractive indices and relative permittivities for gaseous, liquid 
and solid dielectrics. 

3. Give a theoretical treatment which establishes that the conditions which 
apply at a plane boundary between two different transparent dielectric media 
in relation to the continuity of the components of the electric field lead to 
the laws of reflection and of refraction in optics. 

4. Derive Fresnel's equations relating the amplitudes of the reflected to those 
of the incident beam for reflection at a plane interface between two trans- 
parent dielectric media. 

A beam, which is plane polarised at 45° with the plane of incidence, falls on 
the interface. Describe how the state of polarisation of the reflected beam 
varies with the angle of reflection. (L.P.) 

5. Plane polarised light is incident on a plane surface separating two homoge- 
neous isotropic dielectrics, the light being incident from the medium of 
lower refractive index. Draw diagrams which show qualitatively the way in 



ELECTROMAGNETIC THEORY OF RADIATION 33 

which the amplitude, phase and intensity of the reflected disturbance vary 
with the angle of incidence when the electric vector of the incident light lies 
(a) in the plane of incidence and (b) perpendicular to the plane of incidence. 
Derive an expression for the reflection coefficient at normal incidence in 
terms of refractive indices. (L.P.) 

6. Write an essay on the evidence supporting the idea that light consists of 
electromagnetic waves. (L.P.) 

7. Obtain an expression for the ratio of the amplitudes of the electric and 
magnetic vectors in a plane polarised light wave in a dielectric. 

Find the reflected intensity when the light is incident normally at a plane 
surface of separation between air and diamond of refractive index 2-42. 

(L.P.) 

8. What is meant by a displacement current and why is the concept adopted? 
A plane electromagnetic wave in free space is incident normally on the plane 
boundary of a homogeneous loss-free medium of magnetic permeability 
fi = 2 and dielectric constant e = 0-5. Find the reflection coefficient after 
deriving the necessary formula. (L.P-) 

9. Explain the terms normal dispersion and anomalous dispersion. Draw a rough 
graph to illustrate the dispersion of a dielectric throughout a spectral region 
containing absorption bands. 

Discuss briefly the classical electromagnetic theory of dispersion. (L.P.) 

10. Discuss the effects of resonance and relaxation processes on the passage of 
electromagnetic radiation through matter. (L.P.) 

1 1 . The rate of flow of energy per second through unit area drawn within a plane 
wavefront of electromagnetic radiation in which the electric intensity 
is £"V m -1 and the magnetic intensity is HA m -1 is given by 

N = EHJ m-»s-' 

Calculate the root-mean-square electric field strength in air at a distance 
of 5 m from a 10 kW lamp (assumed to be a point source) which is radiating 
uniformly in all directions. 

12. Describe the chief characteristics of the dispersive behaviour of insulating 
materials with respect to electromagnetic waves. 

Discuss the principal processes involved and explain in qualitative terms 
how refraction and dispersion are related to scattering. (I -P.) 



CHAPTER 2 



Quantum Theory of Radiation 



2.1 Origins of the Quantum Theory 

The quantum theory began in 1900 with the famous theoretical 
work of Max Planck concerned with the distribution of energy in 
the spectrum of the radiation from a black-body at a temperature 
r°K. This theory is normally studied within the subject of statistical 
thermodynamics and so will not be developed fully here. Previous 
work by Lord Rayleigh and Sir James Jeans attempted an explana- 
tion in which the radiation energy per unit volume Q(y) at a fre- 
quency v was considered to be in dynamic equilibrium with that 
possessed by the vibrating molecules in the walls of the black-body 
cavity. These vibrating molecules are dipoles and as such their 
electric charges in motion radiate electromagnetic energy. The term 
'harmonic oscillators' was used to describe them. In accordance with 
classical concepts, these harmonic oscillators were allowed to have 
any arbitrary energy which was a continuous function of the tem- 
perature T. This approach, based on statistical thermodynamics 
in which the energy per degree of freedom is ^kT, where k is Boltz- 
mann's constant, led to an equation 

Q(v)dv = ^kTdv 

where Q(v) is the energy per unit volume in the frequency range dv. 
The prediction is consequently that the energy density of the radia- 
tion increases progressively with increase of the frequency v of the 
radiation. 

The experimental results disagree profoundly with this theoretical 
prediction. As shown in Fig. 6.4 of Volume 1, there is a maximum 

34 



QUANTUM THEORY OF RADIATION 35 

in the curve of Q(v) plotted against v. The energy density attains 
a peak value at a particular frequency, depending on the tempera- 
ture of the black-body. At frequencies greater than this value, the 
energy density decreases with increase of frequency. 

This magnificent failure of classical physics led Max Planck to 
postulate that the vibrating molecules (harmonic oscillators) cannot 
have any arbitrary energy, but only energy values restricted to inte- 
gral multiples of a finite amount of energy e, where 

e = hv 

h being Planck's constant = 6-625 XlO -34 J s. This concept that 
energy can only occur in integral multiples of little packets or 
quanta enabled Planck to deduce the equation 

_ , . , 8JT/IV 3 1 

g( " )dy= <* ex P [(/,^r)-l] dv <?■» 

A plot of 20) against v based on Equation 2.1 gives a curve which 
agrees with that obtained experimentally. This triumph of the 
concept of the quantisation of energy inaugurated a revolution in 
the fundamentals of physics of phenomena on the atomic and molec- 
ular scale of dimensions. 

The expression for g(A), where A is the wavelength correspon- 
ding to the frequency v, is obtained on substituting v=c\X in Equa- 
tion 2.1; where the wavelength interval dA is related to the fre- 
quency interval dv by differentiation of the equation 



to give 



c 



dv = -JdA 



So Equation 2.1 can be expressed in the form 

6(A) dA = -^ exp[(hc/k , T) __ 1} dA (2.2) 

This great pioneer work of Planck led subsequently to two further 
results which form substantial foundation stones of the quantum 
theory : the first was the theoretical work of Einstein in 1905 on the 
photoelectric effect; the second, in 1913, was the theory of Niels 
Bohr of the spectrum of the hydrogen atom. 



36 UNIVERSITY OPTICS 

2.2 Einstein's Equation for the Photoelectric Effect 

In his theory of the emission of radiation by a heated body, Planck 
postulated that any change in the energy of a vibrating molecule 
within the body must take place in integral multiples of hv when 
radiation of frequency v is emitted; however, he did not assume^hat 
the energy in the radiation itself was quantised. Possibly he was 
reluctant to do this because of the powerful hold of the electromag- 
netic theory of radiation. 

It was Einstein who, in 1905, put forward the idea that the radia- 
tion itself was in the form of quanta, later called photons. He used 
this hypothesis to explain the emission of electrons from the surface 
of a conducting material when illuminated by monochromatic 
radiation of frequency v. The energy hv of a photon in this radiation 
must exceed <f>e, the work function energy of the material, to cause 
electrons to be ejected. The excess of hv over <j>e will give the ejected 
electrons their maximum kinectic energy }m e »L where v m is their 
maximum velocity and m e is the mass of an electron. 

The appropriate equation for the photoelectric emissive effect is 
therefore 

hv-cpe = i^Vm ( 2 - 3 ) 

This result was amply verified by the experiments of Millikan and 
others on the photoelectric effect. Equation 2.3, deduced by the 
use of a quantum theory of radiation, gives correct results which 
cannot be explained by the electromagnetic theory of radiation. 
Two main considerations in this connection are: 

1. The maximum energy with which electrons are emitted from 
a given material depends only on the frequency v of the inci- 
dent radiation; it does not depend on the illumination of the 
surface. This is in accordance with quantum theory. 

On a wave theory basis, on the other hand, it would certainly 
be expected that the amount of energy concentrated on one 
electron in the surface would be dependent on the illumination. 
For example, for a lamp which approximates to a point source, 
halving the lamp-to-surface separation would increase this 
energy calculated on a wave basis by four times. But this change 
of illumination has no influence on the energy of the ejected 
electrons. The number of electrons emitted per second will, 
however, increase in direct proportion to the illumination. 

2. As an example, suppose a collimated beam of light of wave- 
length A = 5,000A falls normally on a photocathode surface 
of work function energy 1 eV and that the beam conveys to 
this surface an energy of 10 -9 J m~ 2 s _1 . Experiment shows 



QUANTUM THEORY OF RADIATION 37 

that electrons are emitted as soon as the photocathode receives 
the light, the time lag being less than a few nanoseconds. This is 
explicable on the basis that the light is in the form of photons; 
as soon as any one photon of energy 

he ^sxio-^xsxio 18 . vlAM1 

hv = -T = 5^00 = 4X10-»J 

reaches the surface, it makes available energy of 4xl0 -11 /(l-6 
X 10 -19 ) = 2-5 eV. Since this exceeds the work function energy 
of 1 eV, electrons with a maximum energy of 1-5 eVare emitted. 
Considering the light as electromagnetic waves, the area of the 
photocathode which is effectively illuminated by any one wave 
is A 2 = (5x 10~ 7 ) 2 m 2 . The rate at which energy is expended 
over this area by the beam of light is 10-»x25xl0- 1 < Js _1 . 
To provide the necessary work function energy of 1 eV = 
1-6X 10 -9 J would consequently require a time given by 

1.fiy If)" 1 * 

10X1U =6-4xl0 2 s 



25X10- 23 



This implies that an electron could not be emitted unless there 
were a time lag of lOf min, which is completely out of accor- 
dance with the experimental facts. 

2.3 The Bohr Theory of the Spectrum of the Hydrogen Atom 

Atomic spectra are in the form of lines, molecular spectra have 
a band appearance, the spectrum of radiation from a heated solid 
body is a continuum. 

The simplest atom is that of hydrogen : the nucleus is a proton, 
and outside the nucleus is a single electron. Hydrogen gas is nor- 
mally in the molecular state, the hydrogen molecule H 2 consisting 
of two hydrogen atoms bonded together. If an electrical discharge 
is maintained in hydrogen, the molecular bonds will be split up. 
The emission spectrum from a hydrogen discharge is thus due pri- 
marily to the atoms : a line spectrum is observed. 

The work of Balmer in 1885 and subsequent investigations by 
Rydberg in 1890, Lyman in 1906 and Paschen in 1908 established 

the empirical equation 

/i i \ 

(2.4) 



\m 2 « 2 / 



where v is the wave number (the reciprocal of the wavelength), 
m and n are integers with m< n, and R is a constant. This equation 
gave the wave numbers of the lines seen in the emission spectrum 



38 UNIVERSITY OPTICS 

of hydrogen. In the ultra-violet region, studied by Lyman, 
m = 1 and n = 2, 3, 4,. . . ; in the visible region, m = 2 and 
n = 3, 4, 5, . . . ; in the infra-red region, studied by Paschen, 
m = 3 and n = 4, 5, 6, 

There was no fundamental explanation of this equation until the 
work of Bohr in 1913. In particular, no clear idea had been present- 
ed as to why the radiation should be at discrete wavelengths and 
the occurrence of an equation involving a difference term of the 
form (l//w 2 ) — (1/m 2 ) was a mystery. 

In 1911, Rutherford and his co-workers had established the nu- 
clear model of the atom by their work on the large-angle scattering 
of alpha particles transmitted through a thin foil. This model, with 
a central positive nucleus of charge Ze, where Z is the atomic num- 
ber and e the electronic charge, surrounded by electrons in elliptical 
orbits decided by the electrostatic attraction between the nucleus 
and the electrons in accordance with the Coulomb inverse square 
law, was established for atoms in the thin metallic foils of the metals 
used in the scattering experiments. Assuming that this model also 
held for the hydrogen atom and for the simplest case where the ellip- 
tical orbit of the electron is a circle with the nucleus at the centre, 
Bohr extended the quantum ideas of Planck and Einstein to obtain 
a fundamental explanation of the Balmer equation. 

The electron is considered to rotate in a circular orbit around the 
nucleus; in doing so, it is subject to an electrostatic force of attrac- 
tion towards the central nucleus which is balanced against its mass 
times its radial acceleration away from the nucleus. The appropriate 
equation is therefore 

ZeXe _ nteV 2 

4nej* ~ r K > 

where m e is the mass of the electron, v is its linear velocity along the 
tangent of the orbit of radius r, and s is the permittivity of free 
space. 

The electron is subject to radial acceleration. In accordance with 
electromagnetic theory, accelerated charge radiates energy. The 
rotating electron should therefore radiate energy continuously and 
spiral into the nucleus in so doing. The concept of the planetary 
model becomes untenable. Faced with this dilemma, Bohr asserted 
that the electron did not always radiate energy continuously, so the 
electromagnetic theory was inadequate in this context. It remained 
to formulate a quantum approach to this problem. To do this, Bohr 
introduced the postulate that 'stationary states' exist in the atom. 
In a stationary state, electrons in the atom can only rotate around 
the nucleus in certain allowed orbits obeying specified quantum con- 
ditions. An electron can rotate indefinitely in such an orbit without 
radiating energy. 



QUANTUM THEORY OF RADIATION 39 

The quantum condition demanded in such a stationary state was 
shown to be that the action (linear momentum multiplied by dis- 
tance) of the electron in traversing once a circular orbit is an integral 
multiple of Planck's constant h. This condition is therefore ex- 
pressed mathematically as 

m e vX2nr = nh (2.6) 

The integer n became known subsequently as the principal quantum 
number. 

It remained to explain how the atom could radiate energy. Bohr 
alleged that such radiation was the consequence of a transition of 
an electron from one allowed orbit to another. For energy to be 
emitted, this demands that the energy of the atom in the initial sta- 
tionary state exceeds that in the final stationary state. The postulate 
needed can be worded in the following form: if an electron is rota- 
ting in an allowed orbit for which the energy of the corresponding 
stationary state of the atom is E„, it can make a quantum jump to 
another allowed orbit for which the energy of the stationary state of 
the atom is E m . lfE n =- E m , this transition will result in the emission 
of a photon of energy hv, where v is the frequency of the radiation. 
Thus 

E„-E m = hv (2.7) 

Considering the mechanism further, the hydrogen atom is ini- 
tially in an unexcited ground state with its electron in an orbit at the 
minimum allowed distance from the nucleus. Let the energy of the 
atom in this ground state be E . Within a gaseous discharge or 
consequent upon other means of excitation, the electrons can be 
raised to allowed orbits of larger radii, for which the energies of the 
stationary states of the atom are E„. The atom can only normally 
remain in such an excited state for a time less than 10 -8 s. Under the 
electrostatic attraction of the nucleus, the electron makes a transi- 
tion to another allowed orbit, whereby the energy of the atom is 
reduced from£'„ toE m where E„>E m . The frequencies vof theemit- 
ted radiations are consequently decided by the relationship 

E —F 

v = -^ (2.8) 

where a range of values of E n corresponding to n = 2, 3, 4. . . exist, 
and also of E m corresponding to m = 1, 2, 3,. . ., where E is the 
value of E m for m = 1. 

The normal unexcited atom in the energy state Eo can only ab- 
sorb radiation at frequencies decided by the relationship 

v = ^ (2.9) 



40 UNIVERSITY OPTICS 

The absorption spectrum is immediately seen to contain fewer lines 
than the emission spectrum, because there is only one value of E 
in Equation 2.9 but a series of values of E m in Equation 2.8 ; this is 
in accordance with experimental observation. 

The energy of a hydrogen atom due to its electron is equal to the 
sum of the potential energy E p and the kinetic energy £*. The latter 
is \m e v 2 , assuming that the electron does not move with relativistic 
speeds. To specify the potential energy E p , it is assumed that this is 
zero when the electron is removed to outside the influence of the 
nucleus, i.e. the atom is ionised, tantamount to the radius r becom- 
ing infinite. Therefore the potential energy when the electron is at 
a distance r from the nucleus is decided by the work done on the 
electron in bringing it from infinity to the distance r. This will be 
negative, because the electron is attracted to the nucleus. Therefore 



-•p 



J Aneor 2 



47ieor 



The energy E n of a hydrogen atom with its electron at distance r is 

therefore given by 

1 Ze 2 

En =E k +E p = ^m^-^^ 

As 

m.v 2 Ze 2 



r Anedr 2 

so 

1 Ze* 

— m e V i — 75 

2 Sneor 
Therefore 

E "=-W = ~^r (210) 

Collecting together the equations relevant to the Bohr theory of 
the hydrogen atom: 

Zg 2 _ m e v 2 n , 

4^*- r ^ > 

2nrm e v = nh (2.6) 

E„-E m = hv (2.7) 

E » = -W = --^h (210) 



QUANTUM THEORY OF RADIATION 41 

If Equation 2.5 is divided by Equation 2.6, 

Ze 2 

2e&ih 

Substituting this expression for v in Equation 2.10, 

F - l m ZV _ -2&m. oin 

Hence, Equation 2.7 becomes 

Z 2 e*w^ 



/jv = 



8egA 2 (n 2 m 2 ) 
Therefore 

c 8e 2 c/i 3 \m 2 n 2 j y ' 

This equation compares with the empirical Balmer equation 
(Equation 2.4) where the constant R, known as the Rydberg con- 
stant, is given by 

R = -apr (2-13) 

for hydrogen, the atomic number Z being unity. 

Substituting e = 1-6X10" 19 C, m e = 9-11 XlO" 31 kg, e = 
8-854 x 10~ 12 F m- 1 , c = 3x 10 s m s" 1 and h = 6-625 X 10~ 3 * J s, 



^ = (1-6)* X 10- 76 X 9- 1 1 X 10- 31 

8X(8-854X10- 12 ) 2 X3X10 8 X6-625 3 X10- 102 



'TXJX1U°X0-0Z3"X1U _ ' UZ 

= l -098x10^-* 



This value agrees to within 1 part in 1,000 with that found experi- 
mentally by spectroscopy. This major triumph firmly established the 
validity of the Bohr theory of the spectrum of the hydrogen atom 
and inaugurated a new quantum approach to the consideration of 
the radiation in the line spectra from excited atoms. 

An energy-level diagram for the hydrogen atom shows the tran- 
sitions between energy levels which give rise to the lines in the 
spectrum (Fig. 2.1). In this diagram, fine structure of the lines is 
neglected. It is drawn on the basis that the energy E„ is related to 
the principal quantum number n by Equation 2.11, which can be 
written in the form 



42 



UNIVERSITY OPTICS 



.I0NISATI0N REGION 



13-6 



13 
12 
II 
10 



V/;//////;;////;//;/////;///y//////////////////////////;/A 



n=3 



K n=S 



PASCHEN INFRA-RED 

LINES 



n=2 



H e Hp H y Hj H c Hj H^ H^ 

BALMER LINES IN 
VISIBLE SPECTRUM 



"f 



- LYMAN ULTRA-VIOLET 
'LINES 



Fig. 2.1. Energy-level diagram for the hydrogen atom 

To excite the atom from the ground state, in which E„ = E 9 and 
n = 1, to a stationary state, for which n is an integer exceeding one, 
requires an excitation energy E e to be given to the atom decided by 



\l 2 n 2 ) 



E e = Rch 

Therefore 

E e = 1-098X10'X3X 108X6-625 X10- 34 

2-182X10- 18 /l 1 



\i 2 «*/ 



1-6X10-" 



13-67(1-1) 



(l 2 w 2 ) 



eV 



eV 



(2.14) 



The energy required to excite the atom from the ground state 
to the state where n — 2 is therefore 



E e = 13-67(l—|-)= 10-25 eV 



QUANTUM THEORY OF RADIATION 43 

This is the first excitation potential of the hydrogen atom. It is easily 
seen that the second excitation potential (n — 3) is 13-67x-f = 

12-15 eV, and so on for n — 4, 5, 6, Substitution of n = ~ in 

Equation 2.14 corresponds to the removal of the electron to out- 
side the atom (i.e. the ionisation of atom); thus the ionisation en- 
ergy is 13-67 eV, which agrees reasonably well with the experimen- 
tally determined value of 13-584 eV. 

The wave number of the radiation in the line spectrum corre- 
sponding to any transition is also readily calculated. For example, 
from « = 3 to n = 2 will result in radiation of wave number v given 
by Equation 2.12 as 



-'-'$-% 



= 1-098X 10 7 X 0-139 m- 1 
= 1-525XW m- 1 

The corresponding wavelength = Ifv = 6-563 X 10~ 7 m = 6,563 A 
in the visible spectrum. 

The energy-level diagram (Fig. 2.1) shows that the Balmer series 
of lines in the visible spectrum, known as the H x , Hp, H y , . . . lines, 
are brought about by electron transitions terminating at n = 2, the 
Lyman series in the ultra-violet are due to transitions terminating 
at n = 1, whilst lines in the infra-red part of the spectrum are caused 
by transitions terminating at « = 3 (Paschen series), n = 4 (Brack- 
ed: series) and n = 5 (Pfund series). 

The less convenient orbital picture of the hydrogen atom, with 
some of the electron transitions concerned, is shown in Fig. 2.2. 

2.4 The Further Development of the Theory of Spectra 

Bohr's theory of the spectrum of the hydrogen atom inaugurated a 
revolution in theoretical concepts in that it related the structure 
of the atom to the radiation it emits when excited and also to the 
radiation it absorbs. This theory forms, however, only the begin- 
ning of a study which has become very complex. This complexity 
is due to the necessity of accounting for the spectra of the atoms 
with several extra-nuclear electrons, and also to the advent of 
quantum mechanics which has demanded an extensive re-appraisal 
of fundamental ideas, even to the extent of rejecting the concept 
of electrons circuiting specific orbits. 

Today, the theoretical treatment is quantum-mechanical; the 
notions about electrons confined to orbits about the nucleus need 
to be substantially modified and indeed are, strictly speaking, 
obsolete. But it is very demanding on mathematical knowledge 



44 UNIVERSITY OPTICS 



BALMS* 
SERIES 



NUCLEUS 




Fig. 2.2. Circular orbits for the hydrogen atom in the ground state (n = 1) 
and in excited states for which n = 2, 3,4, 5 and 6 



to study the subject from the purely quantum-mechanical stand- 
point; moreover, many of the ideas and much of the terminology 
still extant are based on the orbital approach; finally, the strict 
mathematical treatment tends to obscure insight into the physical 
processes involved. 

The approach here is therefore to deal with the theory of spectra 
from the orbital standpoint and to show the modifications needed 
in the light of the quantum-mechanical treatment. 

The topics arising in developing the theory, most of which are 
the concern of subsequent sections of this book, may be listed as 
follows : 

1. In the theory of the spectrum of the hydrogen atom given in 
Section 2.3, the motion of the nucleus has been ignored. 
Moreover, the element may consist of a mixture of isotopes, 
so that nuclei with the same charge but different masses are 
involved. 

2. An electron revolving about a nucleus under the action of an 
inverse square law of attraction will, in general, describe an 
elliptical orbit of which the circle is only a special case. 

3. The electron will itself spin about its own axis as it revolves 
in an elliptical orbit about a nucleus. 

4. The hydrogen atom has only one extra-nuclear electron. 
More complex atoms, ranging from helium (atomic number 
Z = 2) up to uranium (Z = 92) will have between two and 



QUANTUM THEORY OF RADIATION 45 

92 orbiting electrons. At first sight, this problem seems 
impossibly formidable. For example, iron (Z = 26) has 26 
extra-nuclear electrons. Fortunately, a 26-body problem is 
not needed to explain its spectrum! For the energies involved 
in the production of spectra in the ultra-violet, infra-red and 
visible regions, only the outermost electrons and their motions 
are concerned. The problem is reducible to the consideration 
of spectra involving one, two or three electrons in the major- 
ity of cases. 

5. A single line in an emission spectrum is split into a small 
number of lines when the emitting atoms are within a strong 
magnetic field : a phenomenon known as the Zeeman effect. 
There is a corresponding phenomenon in an intense electric 
field known as the Stark effect. The Zeeman effect is of 
fundamental importance in considering inter-relationships 
between the structure of atoms, radiation and magnetic 
phenomena. 

6. Light scattered on its passage through a transparent medium 
exhibits components having frequencies different from those 
in the incident light. This phenomenon — which is inexplicable 
on the basis of the electromagnetic theory, but is rational in 
terms of the quantum theory — is known as the Raman effect. 

7. The emission and absorption spectra of molecules will be 
considerably different from atomic spectra because of the 
bonds between atoms in the molecule. Band spectra and not 
line spectra are concerned. 

8. The structure of the energy-levels for isolated atoms (in the 
gaseous or vapour state) will be substantially different from 
that involved for atoms in the solid state. 

9. The nucleus of an atom has spin, which will result in a hyper- 
fine structure of the spectral lines emitted. 



2.5 Influence of the Finite Mass of the Nucleus in the Bohr Theory 
of the Spectrum of the Hydrogen Atom 

The nucleus of the hydrogen atom is a proton of mass M p = 1 ,836 m e , 
where m e is the rest mass of the electron. The theory given in 
Section 2.3 assumes that this nucleus is stationary, which is tanta- 
mount to considering it to be infinitely massive compared with the 
electron. In fact, the electron will not rotate about the nucleus 
but about the common centre of mass of the nucleus and the 
electron. 

The centre of mass will be on the line joining the proton and the 
electron. If r is the separation between the proton and the electron, 



46 UNIVERSITY OPTICS 

r' is the distance from the proton to the centre of mass and r" 
the distance from the centre of mass to the electron, clearly 
r = r'+r" and 

m e r" = M/ = M p (r-r") 

Therefore 

r = — 

m e + M p 

and 

mr 

r = 



m e + M p 



The total moment of momentum of the electron and proton about 
the centre of mass is 

m e {r"fca + M p {r'fco 

where co is the angular velocity of the electron (or proton) about 
the centre of mass. Substituting for r" and r' from the previous 
equations, the total moment of momentum is 



<*> r , **> , .^ 2\ m e M p 

(m e +M p f v " * m e +M t 



.2 
P 



This agrees with m e vr — m e cor 2 of Equation 2.6, provided that 
m e is replaced by m' e , where 

m e M p ,~ . ^ 

m e = —^- (2.15) 

so that Equation 2.12 becomes 



ZWm'e I 1 1 \ n , „ 

" = ™-(^-^j (216) 



8egcA 3 



The term m' e is known as the 'reduced' mass of the electron. 

Equation 2.12 would be correct if the mass of the nucleus were 
infinitely large. Thus, R in Equation 2.13 is best written as R„, 
the Rydberg constant for infinite mass. It is easily seen that the 
relationship between R H and R x is then 

*« = ^SP (2-17) 

m e +Mp 

where jR h is the Rydberg constant for hydrogen. 



QUANTUM THEORY OF RADIATION 47 

2.6 The Spectra of Deuterium and of Ionised Atoms having a Single 
Orbital Electron 

Deuterium (f H or ?D), singly ionised helium (|He+), doubly ionised 
lithium (|Li++), triply ionised beryllium (|Be + ++ ), and so on, 
will all have nuclei surrounded by a single electron. For example, 
the £Li atom has three orbital electrons and two of these are re- 
moved on doubly ionising positively. Deuterium and these ions 
would therefore be expected to have 'hydrogen-like' spectra. They 
will have, but the wave numbers of the lines concerned will be 
decided by an equation of the form 

where, for the atom or ion concerned, R is the Rydberg constant, 
Mis the mass of its nucleus and Z the atomic number. For example, 
for singly ionised helium 

where R He+ is the Rydberg constant for singly ionised helium, 
M He is the mass of the helium nucleus, and Z = 2. The wave 
numbers of the lines in the spectrum of singly ionised helium will 
be therefore slightly different from four times the wave numbers 
of the lines in the hydrogen spectrum, the difference being due to 
the factor M H J(m e +M He ) where M He is approximately equal to 
7,300am,,. 

The example of deuterium is of interest: its spectrum will differ 
slightly from that of hydrogen because M is two atomic mass 
units instead of approximately one. This difference led to the 
original discovery of deuterium (heavy hydrogen) by Urey, Brick- 
wedde and Murphy in 1932, who detected the difference in wave 
numbers by the use of a 21 ft concave grating spectrometer. 

Spectrum lines are thus seen to exhibit fine structure (i.e. the 
fact that a single line as observed in a spectrometer of limited 
resolving power will be shown to have a multiplet structure on 
examination at higher resolution); this is brought about by the 
occurrence of isotopes whereby two or more kinds of atoms, 
having substantially the same electron configurations but nuclei 
of slightly different masses, are present in the emitter. It is stressed 
that there are several other causes of fine structure, some of more 
significance than the occurrence of isotopes. Nevertheless, spectro- 
scopic study of fine structure is important in methods of determin- 
ing isotopic constitution. 



48 UNIVERSITY OPTICS 

2.7 Electrons in Elliptical Orbits 

Wilson in England and Sommerfeld in Germany, working inde- 
pendently in 1915 to 1916, developed Bohr's original quantum 
theory of the hydrogen atomic spectrum to take into account 
the fact that an electron rotating under a central attractive force 
to the nucleus, which varies inversely as the square of the electron- 
nucleus separation (ignoring the small correction due to motion 
of the nucleus), will follow an elliptical path which, in special 
circumstances only, is a circle. 

nO. W 

3.1=1 

n=3.l=0 




NUCLEUS 



Fig. 2.3. Elliptical orbits for n = 3 

They argued that each degree of freedom in the elliptical motion 
is separately quantised. From this premise it follows that only 
those specific elliptical orbits are allowable which satisfy the 
criterion 

*='-+! (2.20) 

a n 

where b is the semi-minor axis and a the semi-major axis of the 
ellipse concerned, n is the principal quantum number and / the 
orbital quantum number, also an integer, such that the angular 
momentum of the electron about the nucleus is lh/2it. 

The orbital quantum number / is a vector because it denotes 
angular momentum, and is considered to be in the direction per- 
pendicular to the plane of the elliptical orbit. The possible values 
of A are n— 1, n — 2, n—3, ..., 0. If n = 3, for example, /can be 
2, 1 or 0. Three orbits are therefore possible: they are shown in 
Fig. 2.3. 

An empirical selection rule is found to hold whereby, in tran- 
sitions of electrons from one orbit to another, I can only change 
by unity ; thus A J = ± 1 . 



QUANTUM THEORY OF RADIATION 



49 



For a given value of a, the semi-major axis, a well-known result 
in non-relativistic dynamics is that, for a particle travelling in an 
elliptical orbit under an inverse square law of attraction to a fixed 
focus, the total energy is independent of the eccentricity of the 
ellipse. New energy levels of electrons in an atom due to their 
following elliptical orbits instead of only a circular one are thus 
apparently not introduced. Why, therefore, should there be any 
difference between transitions from or to any one of the three 
orbits illustrated in Fig. 2.3, for example? The answer is that a 
relativity correction is required. In its path in the ellipse, the 
electron will move faster in the immediate vicinity of the nucleus 
than it does further away. The mass changes in accordance with 
the equation of the special theory of relativity : 



m = w (l -/S 2 )- 1 ' 2 



(2.21) 



where /? = v/c, c being the velocity of light in free space, m the 
mass of the particle at rest and m its mass when travelling with a 
velocity v. 

The full, rather complicated, study of this relativistic correction 
leads to a picture of a precessing elliptical orbit whereby there are 
small energy changes with eccentricity. Consequently, fine structure 
of the spectrum lines results from the fact that n elliptical orbits 
are possible for a principal quantum number n, and these orbits 
have somewhat different energies. 

The mechanics of elliptical orbits, elegantly developed by Som- 
merfeld, Wilson and others, has tended to be thrown into the limbo 
of theoretical physics because of the advent of quantum mechanics. 
Nevertheless, the spectral notation, still in current use, has come 
from these ideas. The system of notation is given in Table 2.1. 
Placing the principal quantum number n before the appropriate 
letter, gives, for example, a Ap electron as one for which n = 4 
and / = 1. A simple picture of its elliptical motion via Equation 
2.20 is then immediately visualised. 

Table 2.1. 

THE SPECTRAL NOTATION OF ORBITING ELECTRONS IN ATOMS 



Orbital angular momentum of the 
electron = lh/2n, where I is 





1 
P 


2 
d 


3 
/ 


4 
g 


5 
h 


6 
i 


7 


State of the electron denoted by 


J 


Corresponding state of the atom denoted by 


S 


P 


D 


F 


G 


H 


/ 


J 



50 UNIVERSITY OPTICS 

Note from Equation 2.20 that, when b = a, the ellipse is a 
circle. Circular orbits thus correspond to /+1 = n. It follows that 
Is, 2p, 3d, 4f, 5g, ... electrons are all in circular orbits. 



2.8 Electron Spin 

In several cases of the fine structure of spectra, the hypothesis of 
elliptical orbits has proyed not to furnish an adequate explanation. 
In particular, the spectra of the alkali metals (for which the optical 
spectra involve transitions of only the single-valence electron) have 
lines which exhibit a doublet structure (the well-known example 
being the sodium D lines at wavelengths of 5,890 A and 5,896 A) 
which could not be explained. 

In 1925, Goudsmit and Uhlenbeck postulated that the electron 
rotated about its own axis at the same time as it was rotating in an 
elliptical orbit about the nucleus. The obvious analogy is with 
the spin of the Earth and the other planets about their axes as they 
revolve around the Sun, though it is not satisfactory to read too 
much into such an analogy or even the concept of physical spin 
of the electron itself. They further assumed that this spin had an 
angular momentum of \{hj2n) and was represented by a spin 
quantum number s = -§-• 

This spin quantum number s was shown to be associated with 
the orbital quantum number / in such a way that 

j = l±i 

where j is an additional quantum number, called the inner quantum 
number, and s is either parallel or anti-parallel to /. An electron in a 
2p state, for example, having 1=1, would therefore have j = 1-j- 
or %. The full designation of this electron state is therefore 2p m 
and 2p 3/2 , abbreviated to 2 2 /? 1/2 . 

Again, an empirical selection rule applies to allowed transitions 
between orbits whereby j can only change by or ± 1, so A/ = 
or ±1. 



2.9 The Magnetic Quantum Numbers; Space Quantisation 

The Zeeman effect has already been mentioned (Section 2.4) con- 
cerning the effect of a magnetic field on the spectra emitted by 
excited atoms. It is to be expected that the interactions between a 
magnetic field and an atom should also be quantised in some 
fashion. 

As an electron in motion inevitably produces a magnetic effect, 
an electron in an elliptical orbit about a nucleus would be expected 



QUANTUM THEORY OF RADIATION 51 

to have a magnetic moment. The analogy is with the closed-current 
loop and its equivalence to a magnetic dipole. The electric current I 
equivalent to the charge e of an electron rotating/ times per second 
around the nucleus in an elliptical orbit (Fig. 2.4) is given by 
1 ' = ef, where / is in amperes if e is in coulombs. If the area of this 
elliptical orbit is A, the magnetic moment M due to this current is 
directed perpendicularly to the plane of the orbit and given by 

M = fjiJA (2.22) 

where /j, is the magnetic permeability of free space. 



Fig. 2.4. Magnetic moment of an electron in an elliptical orbit 
The area A is easily seen from Fig. 2.4 to be given by 

A = i j r* d6 (2.23) 

o 

The angular momentum;? of the electron of mass m e moving in an 
elliptical orbit is constant (a fact which follows from Kepler's 
laws in dynamics in the non-relativistic case, assumed here for 
simplicity). Therefore, 

.d0 
dt 



p = m e r 2 -r- = constant (2.24) 



Equation 2.23 can therefore be written as 



T 



2 J m e 2m e 

o 



where T = 1// is the time taken by the electron to complete one 
circuit in the orbit. 



52 UNIVERSITY OPTICS 

Equation 2.22 therefore becomes 



But the angular momentum p = lh/2ji, so 

M=/^ (2.26) 

The factor n^he\^Tim e depends only on fundamental constants: it 
is regarded as a fundamental unit of magnetic moment /3, known 
as the Bohr magneton. The value of (S is found by putting jUo = 
4jtX lO-'Hm- 1 , h=6-62X 10- 34 J s, and e/m e = 1-759X KFCkg" 1 . 
Therefore 

4,X10-^X6-62X 10-^X1-759X1^ = hl65xm -^ hm 

An 

When an atom containing an electron rotating in such an ellip- 
tical orbit is placed in an external magnetic field of flux density B, 
it would be expected that the plane of the orbit — and so the 
magnetic dipole of moment //? — would be oriented with respect 
to the direction of the magnetic field lines. This indeed occurs but, 
as shown by the Zeeman effect and by the experiments of Gerlach 
and Stern on the passage of beams of atoms through magnetic 
fields, it is found that the orientation of the vector / with respect 
to the field is subject to a remarkable phenomenon known as space 
quantisation. 

According to the principle of space quantisation, the directions 
that the vector / (perpendicular to the plane of the orbit and in the 
direction of the magnetic moment due to the orbit) can take up 
are only such that the projections of / in the directions of the flux 
density B are integers. In Fig. 2.5, therefore, where / = 3, the 
directions of / with respect to the flux direction along B must be 
such that the projections of / on the line B have the integral values 
0,1,2,3,-1,-2,-3. 

The projections of / on the direction of the magnetic flux are 
designated by m t , called the orbital magnetic quantum number. There 
are clearly 21+ 1 values of m h all of which are integers, of possible 
values ± /, ± (/- 1), ± (/- 2), . . . , 0. 

There is also a spin magnetic quantum number m s , which is asso- 
ciated with the magnetic moment due to the spin of the electron 
about its own axis. A simple analytical treatment of this concept is 
not possible. It must suffice to assert that the spin magnetic mo- 
ment is found to have a value of /S, the Bohr magneton, and that a 
wide variety of magnetic phenomena can be explained on this basis. 



QUANTUM THEORY OF RADIATION 



53 




Fig. 2.5. Space quantisation and the orbital magnetic quantum numbers 

The corresponding spin magnetic quantum number m s can have 
only two values, +-§- or — ^-, depending on whether it is parallel or 
anti-parallel to the external magnetic flux. 



2.10 The Pauli Exclusion Principle and the Basis of the Periodic 
Table of the Elements 

The quantum numbers so far introduced are n, I, s, j, m, and m s , 
and the vector combination of m, and m s gives rise to a seventh 
one, irtj. It is not necessary, however, to give all these seven quan- 
tum numbers to specify precisely the state of an electron in an atom. 
The principal quantum number n and the orbital quantum number 
/ are necessary, but only two of the remaining five are needed. Of 
these five, those usually selected are m, and m s , because, then, the 
values of s, j and uij are predetermined. 

The four quantum numbers n, I, m, and m s are regarded as basic. 
They form a necessary and sufficient set to specify fully the quantum 
state of an electron in an atom. 

In 1925, Pauli put forward his famous exclusion principle which 
may be stated in the form that each and every electron in an atom 
has a quantum state which corresponds to the specification of the four 
basic quantum numbers, that all of the electrons which make up the 
structure of a given atom are different in this respect, and that no two 
electrons can exist in the atom in the same quantum state (i.e. no two 
electrons can have the same set of quantum numbers n, I, m t and m s ). 



54 UNIVERSITY OPTICS 

For a given element of atomic number Z, there are Z electrons in 
the extra-nuclear structure. Electrons in a given atom having the 
same principal quantum number n are said to be in the same shell. 
For the K shell, n = 1 ; for the L shell, n = 2; for the M shell, 
n = 3 ; for the N shell, n = 4, and so on. 

Electrons within a given shell in an atom which have the same 
orbital quantum number / are said to be in the same sub-shell. There 
are consequently sub-shells corresponding to / = (designated by 
letter s for the electron), / = 1 (letter/?), / = 2 (letter d), I = 3 (let- 
ter/) and so on (see Table 2.1). 

Within a given sub-shell for which the principal quantum number 
is /, there are 21+ 1 possible values of the orbital magnetic quantum 
number mi (Section 2.9), and each of these may have two different 
values of the spin magnetic quantum number m s , which is either \ 
or — \. In accordance with the exclusion principle, it therefore fol- 
lows that the maximum possible number of electrons in a given sub- 
shell is 2(2/+ 1). 

It further follows that the maximum possible number of electrons 
in a shell for which the principal quantum number is n is decided 
by the summation of all possible values of 2(2/+ 1), where / = 0, 1, 
2, ...,«— 1. This maximum number is consequently given by 

' = £ _1 2(2/+1) = 2(1 + 3+5+ ... + [2(n-l)+l]} 
/=o 

= 2« 2 

In addition to the principle whereby electrons go into different 
quantum states with 2n 2 in a shell for which the principal quantum 
number is n and 2(2/+ 1) in a sub-shell for which the orbital quan- 
tum number is /, the electrons, in taking up 'positions' around a 
nucleus, go into the most tightly bound state possible, so entering 
the lowest possible energy levels. This means that the shell with 
n — 1 will become filled first, then that with n = 2, and so on. The 
maximum possible number of electrons in successive shells with 
n — 1, 2, 3, 4 and 5 are therefore 2, 8, 18, 32 and 50. The first 
thirteen elements from hydrogen (Z = 1) to aluminium (Z = 13) in 
the periodic table are considered in Table 2.2. 

In Table 2.2 in relation to atomic spectra, it is noteworthy that 
lithium and sodium each has a single electron outside a closed shell 
structure. They are consequently monovalent and exhibit one-elec- 
tron type optical spectra; only this outermost electron is con- 
cerned in transitions which give rise to spectra in the optical 
region. On the other hand, beryllium and magnesium are seen to 
be divalent and will give two-electron optical spectra, whilst boron 
and aluminium are trivalent and have three-electron spectra. 



QUANTUM THEORY OF RADIATION 55 

Table 2.2. 

ELECTRONS IN THE K, L AND M SHELLS OF THE FIRST THIRTEEN ELEMENTS 





Z 


K shell in = I) 
1 = 


Lshell{n = 2) 


M shell {n = 3) 


Element 


1=0 


/= y 


1 = 


/= 1 


/= 2 






Js 


2s 


2p 


3s 


3p 


3d 


Hydrogen 


1 


1 












Helium 


2 


2* 












Lithium 


3 


2 


1 










Beryllium 


4 


2 


2* 










Boron 


5 


2 


2 


1 








Carbon 


6 


2 


2 


2 








Nitrogen 


7 


2 


2 


3 








Oxygen 


8 


2 


2 


4 








Fluorine 


9 


2 


2 


5 








Neon 


10 


2 


2 


6t 








Sodium 


11 


2 


2 


6 


1 






Magnesium 


12 


2 


2 


6 


2 






Aluminium 


13 


2 


2 


6 


2* 


1 





* Shell filled: an inert gas. 

t Maximum number in sub-shell with 1 = 1, shell filled: an inert gas. 

+ Maximum number in sub-shell with 1 = 0. 



2.11 Spectra of the Alkali Metals 

Table 2.2 shows that the alkali metals lithium and sodium each has 
a single electron in the outermost sub-shell which surrounds a closed 
shell structure. These elements are in group 1A of the periodic 
table; they exhibit one-electron spectra and are monovalent. The 
further alkali metals in group 1A are potassium (Z = 19), rubi- 
dium (Z = 37) and caesium (Z = 55). The potassium atom has 
filled K and L shells, having together 10 electrons, and the remain- 
ing nine electrons are in the M shell, for which n = 3 and which 
can therefore contain a total of 1 8 electrons. Of these nine electrons, 
two are in the sub-shell 3s (for which / = 0) and six in the sub-shell 
3p (for which / = 1). The final valence or optical electron would 
hence be expected to be in the unfilled sub-shell 3d (for which / = 2). 
In fact, it is not but is in the fourth or yV shell in the 4s state. This 
is because the principle of least energy operates, rather than the 
quantum considerations, in deciding the state of the atom. Similarly, 



56 



UNIVERSITY OPTICS 



it can be shown that the other alkali metals, rubidium and caesium, 
have a single optical electron in the outermost shell: the O shell 
(n = 5) for rubidium, and the P shell (n = 6) for caesium. 

The alkali metals are consequently monovalent with a single 
electron responsible for the transitions which give rise to the spectra 
in the visible or near-visible regions. In considering such spectra, 
the chief features are explicable on the basis that this single electron 
undergoes energy level changes outside a structure consisting of the 
nucleus (positive charge Ze) surrounded by a closed shell structure 
containing the other Z— 1 electrons. 




Fig. 2.6. Energy-level diagram for the sodium atom, showing some of the tran- 
sitions in accordance with Aj = or ±1 

The energy-level diagram and some of the transitions involved in 
the sodium atom are shown in Fig. 2.6. The well-known sodium 
doublet at wavelengths of 5,890 A and 5,896 A is explained as due 
to transitions of the atomic state represented by : 

3^3/2 — 351/2, for which A/ = 1, to give the 5,890 A line; 
3Pi /2 - 3^/2 , for which A/ = 0, to give the 5,896 A line. 



QUANTUM THEORY OF RADIATION 57 

2.12 Spectra of Atoms Due to the Transitions of Two or Three 
Electrons 

The elements beryllium and magnesium each has two electrons in 
the outermost sub-shell (Table 2.2). These are group IIA elements 
in the periodic table, of which the further members are calcium, 
strontium, barium and radium; in the companion group IIB occur 
zinc, cadmium and mercury. In all of these elements, two electrons 
are active in producing the emission spectra of the excited atoms. 

Let one of these electrons be in an orbit for which the orbital 
quantum number is h and the spin quantum number s u and the 
other have corresponding quantum numbers / 2 and s 2 . In these 
group II elements, the spins combine vectorially to give a resultant 
spin 5 decided by 

S = s 1 + s 2 

and the orbital quantum numbers also combine vectorially but sep- 
arately to give a resultant orbital quantum number L decided by 
the vector addition 



L = h + l. 



'■2 



The spins si and s 2 are both \, corresponding to spins of i(/i/2rc), 
but the spins may be either parallel or anti-parallel to each other. 
Therefore the value of 5 is either or 1. 

The total inner quantum number /is decided by the vector addi- 
tion 

J = L + S 

In the unexcited state, the two optical electrons are in an s state 
for which / is zero. However, when excited, these electrons can be 
raised to the p state, for example, for which / is a finite number. 
Suppose l x = 1 and h = 0, then!, = 1. Therefore, by vector addi- 
tion, since / = L +S, J = 1 when L = 1 and 5 = 0. Following the 
system of notation in Table 2.1, £ = 1 corresponds to a P state of 
the atom. A singlet level designated by X P X is therefore involved. 

The vector addition of L = 1 and 5=1, however, can take 
place in three different ways (Fig. 2.7) to give / = 2, 1 or 0, so there 
are three different levels because of slightly different energies of the 
magnetic interactions between orbital and spin motions, depending 
on their orientations. The three levels are now designated by 3 P U 
S P 2 and S P , the superscript 3 denoting the triplet, and the subscript 
1, 2 or denoting the / value. 

As a second example, suppose the values of l x and h are both 1. 
The total orbital quantum number L is now the result of the vector 
addition 

L = h + h 



58 UNIVERSITY OPTICS 

$ = l L=l ^ L=» ^ S=l 

J = 




Fig 2.7. The vector addition of L and S, where L = 1 and S = /, to give 

J = 2,1 or 

which gives the results 0, 1 or 2 [Fig. 2.8(a)]. The corresponding 
atomic states are S, P or D respectively. As before, S = or 1. 
Therefore for L = 1 and S = 

J=L+S=2 

giving a singlet 1 X> 2 . For L = 2 and 5=1, the value of / is found 
by the vector addition of L +S. From Fig. 2.8(b) this can be seen to 
give the value of / as 1, 2 or 3, i.e. there is a triplet 3 Di, 3 X> 2 and 3 D 3 . 
Transitions between such levels obey the selection rule A/ = or 
±1. The complex pattern of spectral lines obtained can be inter- 
preted in this way. 

Typical elements in group III of the periodic table are boron and 
aluminium (Table 2.2). Now three electrons will be active in produc- 
ing the spectrum of the excited atoms. For these light elements and 
several of the heavier ones, simplification of the mechanism of the 
energy levels involved arises because of Russell-Saunders or LS 
coupling. This type of coupling is such that the resultant spin quan- 
tum number S is the vector sum of the individual electron spin 




1=0 V \. = 2 

S p • D 



S=l 
-> « 



J=l 




J = 3 



Fig. 2.8. (a) The vector addition of h and / 2 , where \ x = h = 1, to give 

L — 0, 1 or 2; (b) the vector addition of L and S, where 2. = 2 and S = 1, 

to give J = 1,2 or 3 



QUANTUM THEORY OF RADIATION 59 

quantum numbers s u s 2 and s 3 , each \, which are either all parallel 
so S = \, or with one anti-parallel so S = -5-. Furthermore, the 
resultant orbital quantum number L is separately the vector sum of 
the individual quantum numbers l u l 2 and h, i.e. 

L = h+h+k 
Also 

/ = L + 5 

is such that the resultant vector / is half-integral when S is half- 
integral. 

For example, for L = 2 (a D term) the values of / are decided by 
either the vector addition of L — 2 and S = |- (which must give a 
result in the form of a half-integral, -J, f , 1 or i> corresponding to 
quadruplet energy levels designated by *D V2 , *Z> 5 /2, *D 3/2 and *£> 1/2 ) 
or the vector addition of L = 2 and 5 = 1- (giving half-integrals 
in the form of doublet energy levels denoted by 2 Z> 3/2 and 2 A> /2 ). 

Another mode of coupling in three-electron spectra, known asjj 
coupling, occurs but it is rare. The orbital quantum numbers l u k 
and h for the three electrons each combine vectorially with their 
individual quantum numbers s u s 2 and s 3 to give three inner quan- 
tum numbers y'i,y 2 and/ 3 . The resultant inner quantum number /is 
then decided by the vector addition 

J =jl+J2+J3 

The explanation of the mechanisms involved in the interpreta- 
tion of the spectra of the heavier trivalent elements is difficult; a 
type of coupling intermediate between Russell-Saunders (LS) and jj 
is involved. 



2.13 The Wave Nature of 'Particles' 

In 1924, Louis de Broglie extended quantum concepts to include 
aspects of the motion of particles. Before 1924, quantum theory had 
been concerned only with electromagnetic radiation, but de Broglie 
attributed to a particle of mass m travelling with a velocity v relative 
to the observer a wavelength X given by 

* = — = - (2.27) 

where p = mv is the momentum of the particle, and h is Planck's 
constant. 

This theoretical study by de Broglie was verified experimentally 
for electrons in 1927 with the work on the diffraction of electrons by 



60 UNIVERSITY OPTICS 

crystals by Davisson and Germer and, soon afterwards, by G. P. 
Thomson. Subsequent experimental studies by other investigators 
established that this 'de Broglie' wavelength is also true for protons, 
atoms, molecules, neutrons and other particles. 

In the years immediately following 1924, the subject of wave me- 
chanics (quantum mechanics) was developed by Heisenberg, Scb.ro- 
dinger, Born and others who established the basic concepts and 
equations concerned in describing particle motions from the wave 
standpoint. These ideas revolutionised the original treatment of 
atomic spectra as initially given by Bohr and, in particular, enabled 
the intensities of spectral lines to be accounted for in relation to the 
probabilities of electron transitions, a topic about which the 'clas- 
sical' quantum theory was relatively silent. 

The electron has a mass of m e = 9xl0 -31 kg. If it is accelerated 
through a potential difference of V V, it acquires a velocity of v m s 1 
given by 

\m e v 2 = Ve 

where e is the electronic charge = 1-6X10 -19 C. (The relativistic 
correction is ignored here.) Therefore 



\ m e 



and the de Broglie wavelength of the electron in motion is given by 
substituting h = 6-625 XlO -S4 J s into Equation 2.27 as 



A= h 



or 



Therefore 



m e v \/{2Vem e ) 

_ 6-625 XlO- 34 

1 ~ v'(2KXl-6XlO- 19 X9XlO- 31 ) m 



;i = !**XlO-»m = i^A (2.28) 



An electron accelerated from rest through V = 100 V, so acquir- 
ing an energy of 100 eV, therefore has a wavelength of 1-226 A. 
This is of no account if it passes through, say, a spacing of 0-1 mm 
between grid wires in a thermionic vacuum tube but it is clearly of 
fundamental importance in describing interactions between elec- 
trons and lattice spacings in a crystal of the order of a few Angstrom 
units and the motion of electrons in atoms of dimensions 1-5 A. 

The concept of the de Broglie wavelength gives immediately fun- 
damental significance to the postulate of Bohr that, in the stationary 



QUANTUM THEORY OF RADIATION 61 

state, an atom does not radiate energy and that this state corresponds 
to an electron in a circular orbit for which, from Equation 2.6, 

2nrm t v — nh 

The assumption implicit in Equation 2.6 is satisfactory in that it led 
to the triumph of the Bohr theory of the hydrogen atom, but unsatis- 
factory in that there was no fundamental justification for it. Using 
the de Broglie hypothesis, however, X = h/m e v inserted in Equation 
2.6 gives 

2tw = nX 

where n is an integer. The allowed orbits are consequently those 
which will exactly accommodate the de Broglie waves associated 
with electrons. An allowed orbit is thus one in which a stationary 
(or normal mode of) vibration takes place. The analogy is with the 
vibrations possible in a stretched string fixed at both ends. 

2.14 Schrodinger's Equation for a Single Particle 

In 1926, Schrodinger formulated the fundamental differential 
equation for the wave motion associated with a particle. In order to 
simplify the theory involved, consider a plane wave propagated in 
one direction along an x-axis with a wave velocity u, where the 
displacement is y> at a time /.The differential equation representing 
wave motion is then 

d 2 f 1 d 2 ip 

8^" = tf -Qfi (2.29) 

The steady-state solution of this equation in the form of a simple 
harmonic function of time is 

y = V exp jcot (2.30) 

where co = 2nv, v is the frequency and W is the amplitude of the 
wave motion. If Equation 2.30 is differentiated twice, it gives 

d 2 w m 

-— — — co 2 v exp yat = —co 2 y> 

so Equation 2.29 becomes 

d 2 ip a> 2 w 47iVw 4tt 2 

8^ = ~1F = — 1F~ = -JFV (2 - 31 > 

because u = vX. As these waves are associated with particles, 
Equation 2.27 must apply, so hjmv can be substituted for X, where 



62 UNIVERSITY OPTICS 

m is the mass of the particle and v is the velocity of the particle along 
the jc-axis (not the same as the wave velocity u). Equation 2.31 
therefore becomes 

e# = Mri?v (232) 

It is convenient to express this differential equation in terms of the 
total energy E and the potential energy V of the particle, where V is 
a function of x. Now 

E=E K +V 

where E K is the kinetic energy of the particle, given in the non-rela- 
tivistic case by E K = \rm?. Therefore 

mv = vW-fO™] ( 233 > 

Substituting for mv in Equation 2.32 from Equation 2.33 gives 

8 2 v -%n 2 (E-V)m 



Therefore 



Sx 2 ~ A 2 ~ V 



g + ?^ (£ .^ = o 



Recalling the assumption that tp = Wexpyot, division throughout 
by exp jcot gives 

W + *J%L ( E-V)V = (2.34) 

dx 2 h 2 

which is the steady-state form of Schrodinger's wave equation in 
one dimension for a single particle; it is independent of time. 

This simple proof of Schrodinger's equation gives some insight 
into the elements of wave mechanics. It should be stressed, how- 
ever, that the assumption implicit in Equation 2.30 is not strictly 
justifiable. In fact, taking into account that E = hv, it can be shown 
that f must be described by a complex expression of the form 

f = C exp [±j(2nvt— kx)] 

where it is customary to use the negative exponent. From this it can 
be shown that the amplitude W is a complex quantity of the form 
a+jb. 



QUANTUM THEORY OF RADIATION 63 

2.15 The Heisenberg Uncertainty Principle; The Probability 
Concept 

A consequence of the de Broglie assertion as to the wave nature of 
a particle is that the position of a particle in space cannot be exactly 
specified. The electron cannot therefore be considered as having 
a position which is precisely defined by, for example, spherical polar 
coordinates (r,6,(j>) within an atom, because it is not possible to 
consider the electron as a point but only as a small group or packet 
of waves. Within the hydrogen atom, for example, the wavelength 
of the electron in the state where the principal quantum number 
is unity (i.e. the ground state) is shown in Section 2.13 to be equal 
to the circumference of the postulated circular orbit. 

In 1927, Heisenberg enunciated his famous uncertainty principle. 
This may be expressed by the equation 

Ax Ap x s= h (2.35) 

when referring to a particle travelling along an x-axis with a mo- 
mentum p x at the position coordinate x. The term Ax is the uncer- 
tainty in the specification of x, Ap x the uncertainty in determining 
p x , and h is Planck's constant. It follows that the distance x of an 
electron from an origin can only be specified as being between x 
and x+ Ax, and that Ax can only be reduced at the expense of in- 
creased lack of certainty in giving the linear momentum. 

This principle sets a fundamental limit to the possibility of deter- 
mining, by any conceivable experiment or theoretical method, the 
position of a particle in space. There is consequently no certainty 
in specifying the position of an electron in an atom: the picture of 
elliptical electron orbits is useful but fundamentally without precise 
meaning. What model of the atom exists therefore in quantum 
mechanics? 

In dealing with wave motion, consider first the electromagnetic 
theory of light. Suppose the amplitude of the electric field strength 
in a uniform collimated beam of light at a point in space is E . 
The intensity of the light is proportional to £|. If this beam of light 
were described from the particle point of view, the intensity at the 
point in question would be given by the number N of photons that 
passed per second through unit area drawn around the point in the 
plane perpendicular to the beam. As these two methods of specify- 
ing the intensity must be equivalent, it follows that 

£g = constant xN* (2.36) 

where the constant of proportionality depends simply on the units 
chosen. 



64 UNIVERSITY OPTICS 

In describing the motions of electrons and other particles by 
quantum mechanical procedures, Schrodinger's equation is used. 
Solution of this equation gives W, the amplitude of the wave motion 
associated with the particle at the coordinate x. In atomic structure, 
only one or a few electrons are involved and not a large number of 
them as with photons in a beam of light. The procedure, therefore, 
is to extend the idea implicit in Equation 2.36 by introducing the 
probability concept. For a small volume bv around a point with given 
coordinates, the probability P of the particle existing within dv 
is taken to be the product of dv and the square of the wave ampli- 
tude f . Thus, 

P = constant X I ^l 2 dv (2.37) 

where | W\ is the modulus of the complex amplitude W of the 
wave function. 



2.16 The Quantum Mechanical Theory of the Hydrogen Atom 

To use the Schrodinger time-independent equation in dealing with 
the probable locations of electrons in the theory of atomic structure, 
Equation 2.34 must be extended to three-dimensions. It becomes 

v 2 W+ % -^^{E-V)W = (2.38) 

where m 5 is the mass of the electron and v 2 is a differential operator 
which is given in Cartesian coordinates by 

8 2 8 2 8 2 

v 2 = (-■ 1 

8x 2 dy 2 dz 2 

and in spherical polar coordinates, which are obviously more ap- 
propriate in relating probable electron positions to a point nucleus 
(assumed to be stationary), by 

2 ^ 2^8^ J_ 3^ J_ dW 1 8»y 

V dr 2 + r dr + r 2 80 2 + r 2 80 + r 2 sin 2 8<£ 2 

(2.39) 

where the spherical polar coordinates of the electron are (r, 0, <j>) 
relative to the nucleus at the origin. 

The potential energy V of the electron when at a distance r from 
the nucleus is — Ze 2 /4me r, as shown in Section 2.3. The equation 
to be solved for W is therefore given by substituting this expression 



QUANTUM THEORY OF RADIATION 65 

for V and Equation 2.39 for \7 2 W into Equation 2.38. It is conve- 
nient to put h = h/2iz. Thus, 

Br* + r Br + 7* S0 2 + 7* COt ° p" + r » sin 2 6 B^ + 

Solutions of this wave equation are sought subject to the conditions 
that W must be everywhere finite, single-valued, a continuous func- 
tion of the coordinates (r, 6, </)) and have continuous first derivatives. 
These stipulations replace, in effect, the Bohr postulate of station- 
ary states in the older theory. 

The full solution of Equation 2.40 is complicated. It must suffice 
here to state that it can be conveniently expressed in the form of 
three equations : 

d 2 /(r) 2 Af{f) Jm, ( -_,_ Z* \ .. . n ,„ ,„ 
where /(r) is a function of r only, 

^^+cot0 d/l(0) +/r/+n^ w <^> n n ™ 

where f-0) is a function of 9 only, and m, and / are constants, and 
^r-+™f /*(<!>) = (2.43) 

where / 2 ($) is a function of $ only. Moreover, 

The conditions stipulated further demand that / and m, are integers 
and that m, has the 2/+1 values /, /- 1, 1-2, . . ., 0, 1, 2, . . ., /. 
Solution of Equation 2.41 with these provisos results in an expres- 
sion for the allowed energy values of the hydrogen atom which is 
the same as that given by Bohr (Equation 2.11). 

It can further be shown that vW+ 1)] is the magnitude of the 
orbital quantum number / introduced in the elliptical orbit theory 
(Section 2.7) and that m, is the magnitude of the orbital magnetic 
quantum number m t introduced in Section 2.9. 

A result of quantum mechanics which demands modification of 
previously expressed ideas is that the magnitudes of the quantum 
number vectors / and s are not simply / and s but VW/+1)] and 



66 UNIVERSITY OPTICS 

y/[s(s+ 1)]. This requires reappraisal of the ideas of vector addition 
of I and s to give the inner quantum number j. This vector addition 
must be such that the magnitude ofj is not simply j but such that 

VUU+m = V[/(/+l)]+VM*+D] 

(see Section 2.19). 

It is an instructive simplification to consider only the spherically 
symmetrical example of the hydrogen atom, and so assume that 
W varies only with the radius vector r. Equation 2.41 alone is then 
required because variations with 6 and 4> are ignored. Equation 
2.41 can then be written as 

d^ 2 d»[ 2«./ » W = p.^ 

dr 2 r dr ^ # 2 \ Aneor) v 

The simplest solution of this equation is 

«P=exp(-/-a) (2.45) 

where a is a constant, a solution which satisfies the requirement that 
W tends to zero at large distances r of the electron from the nucleus. 
Substituting in Equation 2.44 from Equation 2.45 gives 

a* exp {-ra)-- exp (-„) +1 ±[e+-^) exp (-ra) = 

Dividing through by exp (-ra), 

2« .2 1 "' /«■_■_ j??L\,n (246) 



2a 2m. 1 „ Ze?\ _ 



As Equation 2.46 must be true for all values of r, it follows that 
the sum of the terms which are independent of r must be zero. There- 
fore 

fl2+ 2^ = o (2.47) 

The coefficient of 1/r must also be zero. Therefore 
_ , 2m e Z& . 



or 

m e Ze 2 



a = 



47ifi?eo 



(2.48) 



QUANTUM THEORY OF RADIATION 

Substituting this value for a in Equation 2.47 gives 

mJZ?e* 



67 



2m e ~ 2m e X lejr 2 /^ 2 



32ji 2 /i 2 e 2 



or, as h = 27th, 



E = - 



8e*h* 



(2.49) 



which is the same as for the expression for the energy E given by the 
Bohr theory for the case where the principal quantum number 
n = 1 (Equation 2.11). 

Applying the probability concept, as expressed by Equation 2.37, 
to the hydrogen atom, it is seen that the probability of finding the 
electron within the spherical shell having inner and outer radii of 
r and r+dr is decided by | W^Anr^br because dw=4jrr 2 dr. Presuming 
that | W\* has been evaluated for the hydrogen atom, the probability 
P r of the electron being at a distance r from the nucleus is therefore 
given by 

P r = constant X | iP | 2 r 2 

Plots ©f P r against r are given for the hydrogen atom in Fig. 2.9, 
for the principal quantum numbers n = 1, 2 and 3. 



100 
10 



' 2 „ 

BOHR 7 DISTANCE, r (») 

SAOIUS 

(a) 




2 3 4 5 6 
DISTANCE, r (J) 

(b) 



>.* 30 


















L 






























2 io 

o 
g= 
























^— J ^^^^s 



















DISTANCE, r (X) 
(C) 

Fig. 2.9. Probability distributions for the electron in a hydrogen atom: (a) for 
9 => 1; (b) for n = 2; (c) forn = 3 



68 UNIVERSITY OPTICS 

The quantum-mechanical picture of the hydrogen atom (with 
extensions to more complicated atoms) is different from the Bohr 
or older quantum theory model. The electron is now considered to 
have a probability of existing at any distance from the nucleus, 
though its exact position cannot be specified because of the uncer- 
tainty principle. Indeed, as shown by the graphs of Fig. 2.9, the 
electron has a very small probability not only of existing very close 
to the nucleus but also at distances significantly greater than the 
Bohr orbital dimensions. This model is more satisfying than the 
curious Bohr one in which electrons can only exist in certain specific 
orbits with 'nothing' between these orbits. 

The most probable position of the electron within the electron 
'cloud' about the nucleus is where P r is a maximum. Where n = 1 
and the hydrogen atom is spherically symmetrical, putting 
W = exp ( — rd) from Equation 2.45 gives 

P r = constant Xr 2 \W\ 2 = constant Xr 2 exp ( — 2rd) 

The maximum value of P r occurs at r = r u where 

— [r 2 exp (— 2ra)] = 

i.e. where 

-2a exp (-2ria)rf+2fi exp (-2ria) = 
Thus 

1 

n = — 

a 

Substituting for a from Equation 2.48, 

4nft 2 e eoh 2 



n = 



m e Ze 2 TrnigZe 2 
which is the same as the Bohr radius for n = 1. 

2.17 The Zeeman Effect 

This effect is that spectral lines are split up into components when 
the emitting atoms are placed in a magnetic field. 

The effect may be observed in a college experiment by the use of a 
neon-filled Geissler tube placed in a strong magnetic field provided 
by an electromagnet. The longitudinal effect is observed in the light 
emanating from the discharge tube in the direction of the magnetic 
field lines. This requires that a hole be bored in one of the magnet 
pole-pieces [Fig. 2.10(a)]. The transverse effect is observed in the 



QUANTUM THEORY OF RADIATION 



69 



light passing perpendicularly to the magnetic field lines; this is 
more convenient in practice as it does not require a bored pole- 
piece. 

If the frequency of one of the lines in the spectrum is v (zero mag- 
netic field) it will be split into components of frequencies vi and v 2 
when the magnetic field is applied. Observation of the longitudinal 
effect is represented in Fig. 2.10(b): the lines of frequency j>! and 
v ? (assuming there are only two) appear on either side of the posi- 
tion of the initial line of frequency v, which is absent. The light in 




NO MAGNETIC FLUX 



TO SPECTROMETER 
WITH FABRY- PEROT 
ETALON 



FLUX 
— ►— 



' DIRECTION 2 
DIRECTION 
PERPENDICULAR 
TO PLANE OF 
DIAGRAM 



HJ 




FLUX APPLIED ; 
TRANSVERSE EFFECT 



FLUX APPLIED; 
LONGITUDINAL EFFECT 



(a) 



(b) 



Fig. 2.10. Observation of the Zeeman effect: (a) experimental arrangement 
(direction 1 is longitudinal, and direction 2 is transverse) ; (b) appearance of 
spectrum lines (the light direction is perpendicular to the plane of the diagram) 

these components is circularly polarised, one being in the clockwise 
direction and the other anticlockwise. In the transverse effect, the 
components again appear at j>i and v 2 but their electric vectors are 
linearly polarised in the direction perpendicular to both the field 
direction and the light direction; furthermore, the original line of 
frequency v is now present. 
It can be shown (Section 2.18) that 



and 



Vl = v + 



p 2 = v — 



Be 

4nm e 

Be 
4jtw„ 



(2.50) 
(2.51) 



70 UNIVERSITY OPTICS 

where B is the magnetic flux density in webers per square meter, 
e is the electronic charge in coulombs and m e the mass of the elec- 
tron in kilograms. If J? is measured and the change of frequency 
Aj> = j>!— v = v 2 — v is determined, the specific charge e\m e of the 
electron can be found. 

Given that e/m e = 1-759X Vfi 1 C kg -1 , suppose that a magnetic 
flux density of 0-2 Wb m -2 (2,000 gauss) is used. The change of 
frequency Av is then 

Av = ^X 1-759X 10 11 = 2-8X 10" Hz 

The frequency v will be at a corresponding wavelength A of about 
6,000 A in the red of the spectrum, say. Therefore v = (3X 10 18 )/ 
6,000 = 5x 10 14 Hz. The change of frequency brought about by a 
flux density of 0-2 Wb m -2 is therefore only 2-8X 10 9 in 5x 10 1 *, or 
1 part in 1-78X10 5 . Observation of such a small change requires 
a spectrometer of high chromatic resolving power; an interferom- 
eter such as the Fabry-Perot instrument in conjunction with a 
constant-deviation prism spectrometer is therefore demanded. As 
an interference technique is necessary, it is essential to change 
the circular-polarised light components in the longitudinal effect 
into plane-polarised light by insertion of an appropriate quarter- 
wave plate (Section 3.10). 

The observation of the Zeeman effect obviously demands large 
magnetic flux densities ; the effect in the Earth's magnetic field will 
be negligibly small. 



2.18 Theory of the Normal Zeeman Effect 

The Zeeman effect is often referred to in two connotations as the 
normal Zeeman effect and the anomalous Zeeman effect. This divi- 
sion is artificial, however: the distinction is that in the normal effect 
(which can be explained by classical electron theory) the spin of the 
electron is absent and only the electron orbital motion need be con- 
sidered, whereas in the anomalous effect (which cannot be explained 
without quantum theory) both spin and orbital motions play 
a part. 

The sophisticated theory of the Zeeman effect is a quantum- 
mechanical one. This is demanding mathematically and will not be 
attempted here. The theory given is based on the older quantum 
theory of electron orbits, but modifications are introduced as re- 
quired in the light bf quantum mechanics. Despite the artificiality of 
the distinction between the normal and anomalous effects, it is 
convenient to consider the normal effect first because it does not 
involve the introduction of electron spin. 



QUANTUM THEORY OF RADIATION 



71 



The orbital angular momentum of an electron making an ellipti- 
cal circuit about a nucleus in an atom is given by Ih, where / is the 
orbital quantum number and h = hj2n (Section 2.7). Associated 
with this orbit is a magnetic moment of //3, where /? is the Bohr 
magneton = no he/4nm e (Section 2.9). 

Suppose the elliptical orbit for the optical electron concerned is 
in a plane such that the perpendicular vector / of length Ih (the 
angular momentum) is OA at an angle to the lines of force of 
a magnetic field of flux density B (Fig. 2.11). The vector OC of 
length //S representing the magnetic dipole moment of the orbiting 
electron will also be perpendicular to the plane of the orbit. 



4 DIRECTION OF 
MAGNETIC FLUX 
OF DENSITY B 




ELECTRON ORBIT IN 
PLANE PERPENDICULAR 
TO OA 



Fig. 2.11. Elliptical orbit of electron within a magnetic field (OA is not in general 
in the plane of the diagram; OC is anti-parallel to OA) 



By convention, the vector representing the angular momentum 
of a body is drawn in the direction of forward travel of a right- 
handed corkscrew turned in the direction of the rotation of the 
body. The direction of OA in Fig. 2.11 is thereby decided. The con- 
ventional direction of the electric current in a circuit is that of a 
positive charge; the electron in the orbit is negatively charged. The 
equivalent conventional current direction is therefore opposite to 
that of the electron. The vector OC of length //? representing the 
magnetic moment of the dipole equivalent to the circulating elec- 
tron is therefore anti-parallel to OA. 

If a dipole of magnetic moment M is placed in a magnetic field 
of strength H at an angle 8 to the field lines, it experiences a couple 
of magnitude MH sin 6. Hence the magnetic dipole of moment Ifi 
will experience a couple T of magnitude IfiH sin 0. The effect of this 



72 



UNIVERSITY OPTICS 



couple is to make the elliptical orbit precess about the magnetic 
field direction, so the vector OA of length lh will do likewise. 
Owing to this couple, acting perpendicular to OA, the angular 
momentum will change in direction but not in magnitude. 

In Fig. 2.12(a), the vector lh is along OA at a time t, where OA 
is at an angle 8 to OX, the direction of the magnetic flux of density B. 
Suppose OA moves to OD, still at the angle 0, in time bt, where the 




id, 

/ _ - -f D 

■==.X + 






(b) 



.Fi£. 2.iZ Precession of angular momentum vector lh about the direction of the 
magnetic flux (AE and DE are perpendicular to OX; OD is behind OA and 

^DOE= 6) 

small angle between AE and DE (both drawn perpendicular to OX) 
is b<j>, and AE = DE = lh sin 0. In Fig. 2.12(b), therefore, AD rep- 
resents the change of angular momentum d(lh) in the time dt and 

AD d(lh) 

d(t, = El^WlkTe (2 - 52) 

Now b{lh) is due to the couple T acting for the time dt. Therefore 

b{lh) = T bt= l(iH sin 6 bt 

so Equation 2.52 becomes 

b(j> IfiHsmd 



bt 



lh sin 6 



CO = 



Hence, the angular velocity of precession of the elliptical orbit is 
co, where 

d(ft _ IpH sin 8 PH 

dt ~ lh sin 6 h 
As /? = fiohe/4jim e and h = h\1n, 

fj,oHhe/47im e HoHe 



h/2n 



2m. 



Be 

2m e 



(2.53) 



QUANTUM THEORY OF RADIATION 73 

The change of angular velocity due to this precession of the 
orbit is much smaller than the angular velocity eo of the electron 
in the elliptical orbit around the direction of the magnetic flux 
lines. If Ig is the moment of inertia of the orbiting electron about 
the direction of the magnetic flux of density B, the change in the 
kinetic energy AE on applying the magnetic flux is given by 

AE = $I B (e>o+cof-il B co% 
As m is much less than a> , 

Be 

AE = I B cooa> = Jbooo^— (2.54) 

To consider the effect of this additional kinetic energy AE it 
must be taken into account that the angle between the vector 
Ih and the magnetic flux direction can only assume certain values 
in accordance with the principle of space quantisation (Section 2.9). 
From Fig. 2.5, it is easily seen that the permitted values of the 
angle are such that 



cos 8 = — (2.55) 



where iw; is the orbital magnetic quantum number. 

The orbital angular momentum is Ifi, so its component parallel 
to the magnetic flux lines is Ih cos 0, which is seen to be mfi 
from Equation 2.55. This component of the orbital angular momen- 
tum is also given by I B co ; therefore 

Ibo>o = mfi 

Substituting these values for Igtoo into Equation 2.54 gives 

a c , Be mji Be 

A£ = ^^7 = ^r x 2^ < 2 - 56 > 

where m, has the range of values /, /— 1, 1-2, . . ., 0, 1, 2, . . ., I. 
Equation 2.56 will apply only to an atom within a magnetic 
field of which a particular line in the spectrum is not affected by 
electron spin. An example is the red line of cadmium at 6,438 A, 
for which the two optical electrons have spins which are anti- 
parallel to one another so the resultant spin is zero. Consider, 
then, a transition from a state where L = 2 to L = 1, where the 
corresponding energy levels are E s and E u and E 2 > E x . In the 
usual spectrum with no external magnetic flux applied, a single 
line of frequency v is emitted by the excited cadmium where 

v = E *- El 



74 



UNIVERSITY OPTICS 



h being Planck's constant. On application of the magnetic flux, 
the energy level E 2 corresponding to L = 2 will be split into five 
different levels corresponding to 2L + 1 different values of m,. 
These energies E 2B will be decided by Equation 2.56, so are given 
by the equation 

he 



E 2B = Ez + nti 



Anm. 



B 



where mi = 2, 1, 0, — 1, 
energy levels in Fig. 2.13. 



-2, represented by the upper set of 



L=2,S=0 



L=l,S=0 



, . NINE TRANSITIONS BUT ONUT 

T THREE OF DIFFERENT MAGNITUDES 



— 2 

— I 

— 

1 

— z 



OF E, 



THREE VALUES 
OF E,„ 



(a) 



(b) 



Fig. 2.13. Allowed transitions in the normal Zeeman effect: (a) no applied 
magnetic flux ; (b) magnetic flux of density B applied 

In the magnetic flux, the energy level E lt corresponding to 
L = 1, will split into three different levels corresponding to 2L + 1 
values of m;. Now Ei B will be decided by the equation 



E\b = Ei+nii 



he 
4nm e 



where nti = 1,0, - 1, represented by the lower set of energy levels 
in Fig. 2.13. 

Transitions in the emission spectrum with the magnetic field 
applied take place from the upper set of five levels to the lower 
set of three, in accordance with the selection principles that m t 
must change by or ±1, whereas L must change by ±1. These 
emission spectrum lines of frequencies v B are therefore decided by 



hv B — E2B—E1B = Ei—Ei+Ami 



he 
Anm, 



B 



QUANTUM THEORY OF RADIATION 75 

where Am,, the allowed changes of m,, are + 1 or 0. For Am, = 0, 

, E2—E1 

Vb = -. = V 

h 

the original frequency with B = 0. For Am, = + 1, 

Ei-Ei Be 

v B = -. f- - 



Anm e 



For Am, = — 1, 



E 2 -E! Be 

vb — -, — 



Anm 



The change of frequency A v from the original frequency v is con- 
sequently given by 

Av = v'b - v = v- v'b = -r^~ (2.57) 

In the energy-level diagram of Fig. 2.13, there are nine possible 
transitions for Am, = or ± 1, but clearly only three components 
of different frequencies are emitted. 



2.19 The Theory of the Anomalous Zeeman Effect 

In the anomalous Zeeman effect, the electron spin now contributes 
to the spectrum, so the inner quantum number./ is involved instead 
of / alone, where 

j = l+s 

According to the older quantum theory, the magnitudes of the 
vectors / and s are integers and ±\ respectively, and the vector 
addition concerned is simply a parallel or anti-parallel addition, 
so j = l±\. Subsequent developments in quantum mechanics 
have shown that this is incorrect. Without entering into the com- 
plexities of the quantum-mechanical theory it suffices to state that 
the older quantum theory based on orbital considerations gives 
substantially satisfactory results provided that the magnitude of 
/ is taken to be VW+ 1)] and that of s to be • v /[^+ 1)] (Section 
2.16). If, for example, 1 = 2 where s = \, then the vectors / and s 
must be added in such a way that the length of vector / is taken to 
be y[2(2 + l)] = -v/6; the length of the vector s is taken to be 
VMi + l)] = iV3; and the resultant vector has a length VL/O'+l)] 
where j = 2\, so is of length VHK! + 1)] = | /35. This means that 



16 



UNIVERSITY OPTICS 




r _ ,/[•(•+■)]= s ,/j 

Fj£. 2./4. 77ie quantised vector addition of I and s, in accordance with quantum 
mechanics, for 1 = 2 and s = \ 

the angle x between the vector I and s is predetermined and is seen 
from Fig. 2.14 to be given by 

j(j+ 1) = /(/+ l)+s(s+ 1)+ 2 vW+ 1) s(s+ 1)1 cos a 



Therefore 



cos a 



y(y+ 1)-/(/+ i)-^+i) 

2a/[/(M-1M*+1)1 



(2.58) 



The magnetic moment M t due to the orbital motion of the 
electron is given by l($, where /S is the Bohr magneton, and the 
magnetic moment Af, due to the electron spin is /S (Section 2.9). 
The resultant magnetic moment M will be given by the vector 
addition 

M = M,+M S 

Consider an atom where / = 1 and s = \, and only one electron 
is involved in producing the spectrum. In Fig. 2.15, vector OA is 
of length VHO + 1 )] = V2 to represent /, whilst OB is of length 
VE-Ki + O] = W 3 to represent s, giving a resultant j along OF, 




-M t -/9 - 



Fig. 2.15. The vectors representing orbital and spin angular momentum and 
magnetic moment for an atom with a single optical electron 



QUANTUM THEORY OF RADIATION 77 

where ^.BOA = x is decided by Equation 2.58 to be 

cos- MID zKi±i^Jii±L) = eos-, 0-408 = 65=54' 
2 V Il(l + l)i(i+l)] 

The orbital magnetic moment M, will be /? and so will the spin 
magnetic moment. These will be represented by vectors OD and 
OE respectively; these vectors are in the directions OA and OB 
respectively, but have a resultant M which is along OG and clearly 
not along the direction of OF, which represents/. 

Fig. 2.15 represents the vectors / and s in a particular plane. 
In a finite time, the electron orbit (and therefore these vectors) 
will occupy all possible planes so, in effect, will rotate about the 
vector /; they will rotate much more rapidly than the precession 
about the magnetic flux direction. The average effect of the com- 
ponents M, and M s in a direction perpendicular to / is therefore 
zero. So the average effect over a finite time of M will be M h 
the projection of M on vector/, which is OH in Fig. 2.15. Within 
a magnetic flux of density B, the energy associated with a magnetic 
dipole Mj at an angle to the direction of the magnetic flux is 
given by MjB cos 0. The values of permitted in accordance with 
the principles of space quantisation are decided by the equation 

cos = -+- 

J 

where m } is a magnetic quantum number having 2j+ 1 possible 
values. The possible energy values E associated with the effect of 
the magnetic flux on the orbiting electron are hence decided bv 
mjMjB/j. 

The magnetic moment Mj is equal to the sum of the projections 
of M/ and M s resolved along the direction of the vector/. Therefore 

Mj = M, cos (l,j)+M s cos (s,j) 

where (/,/) is the angle between vectors / and/, and (s,j) that be- 
tween s and /. Therefore 

Mj = 0[ I cos (/, /) + 2* cos (s, /)] (2.59) 

because M, = Ifi and M s = £ = 2s/S where s = \. From Fig. 2.14, 
it is seen that 

s(s+ 1) =/(/+ 1)+/(/+ 1)-2 VUU+ !)/(/+ 1)1 cos (/,/) 
Therefore 

cos (i n - j(j+ !)+'('+ i)-4?+i) 



78 UNIVERSITY OPTICS 

and, likewise, 

., jU+l)+s(s+l)-l(l+l) 
costs, j)- 2VUU+l)s(s+l)] 

Substituting these values for cos (/, j) and cos (s, j) in Equation 
2.59, together with yW+O] and Vl s ( s + i )] for the magnitudes 
of / and s respectively, 

.. _ J(j+l) + l(l+l)-s(s+l) + 2[j(j+l) + s(s+l)-lV+l)] 
Mj ~ P 2VW+1)] 

ft 3JU+l) + *s + !)-/(/+ 1) 

~ P 2V[;(;+1)1 

As the possible energies E due to the magnetic flux are mjMjB/j, 
putting the magnitude of j as VUU+ 01 8 ives 



Thus 
where 



VUU+i)] P 2VUU+V] 

E = m jg pB (2.60) 



i.e. where 



_3j(j+l) + s(s+l)-l(l+l) 
8 ~ 27(7+1) 

_ j j(j+l) + s(s+l)-Kl + l) (2 61) 



The factor g is called the Lande splitting factor. 

There are 2j+ 1 possible values of ntj, seen from Equation 2.60 
to correspond to energy levels separated by intervals of gfiB. 
For example, the spectrum of sodium contains a line of wavelength 
5,896 A owing to the transition 3P 1/2 - 3S V2 (Section 2.11). In 
the 3i>i /2 state of the atom, corresponding to 3pi, 2 for the single 
optical electron involved, /= 1, s = -\ (of magnitude T ) and 
J = -f> so 

_, ~H*+i) + i(i+i)-Ki + i) = x 
g ~ l+ 2xl(i+i) 

In the 3Si /2 state, which is 3s 1/2 for the electron, / = 0, s = \ and 
y* = \, hence 

, = 1 .iii±I)±i(i±ib° = 2 

f + 2xi(i+l) 
In each instance, ntj =±i; so giw, is ±ifor the 3P m term and 



QUANTUM THEORY OF RADIATION 



79 



gin, 



3P„ 



•"•X = 5,8»6A 






(a) 



(b) 



Fig. 2.16. Transitions in the anomalous Zeeman effect resulting from the splitting 

of energy levels 3P 1/S and 3Su t for the sodium atom: (a) zero magnetic flux 

(single line) ; (b) magnetic flux applied (four lines for Am t = ±1 or 0) 

+ 1 for the 35*1/2 term. As the change of m, permitted is ± 1 or 0, 
the four transitions possible between energy levels give a quad- 
ruplet in place of the original single line (Fig. 2.16). 



2.20 The Stark Effect 

Zeeman discovered the effect named after him in 1896. In 1913, 
Stark observed that the lines in the visible spectrum of hydrogen 
atoms were all split symmetrically into components when the 
emitting source was subject to an electric field of about 10 s V cm -1 . 
The explanation of this phenomenon again demands the quan- 
tum theory of radiation, but is too complex to consider here. 
It can be established that changes A v which occur in wave number 
v of an emitted line as a result of an electric field of strength 
E V cm" 1 are given by 

Av = 6-45XlO- 5 («i- n 2 )+ak,k s 

where a is a constant, ki and k s are the projections of orbital and 
spin angular momentum in the direction of the electric field (so 
they compare with the magnetic quantum numbers m t and m s ), 
n is the principal quantum number and tii and n 2 are modified 
values of n which can be deduced from quantum theory. 



2.21 Molecular Spectra 

In an introductory explanation of the mechanisms involved in the 
productions of band spectra by molecules, it is convenient to 
consider a molecule containing only two atoms, bearing in mind 



80 UNIVERSITY OPTICS 

that the concepts enunciated below for diatomic molecules also 
apply to polyatomic molecules. 

When energy E is imparted to a diatomic molecule, the magni- 
tude of E may be such that it can excite quantised molecular 
energy levels as a result of three main phenomena: rotation of the 
molecule about an axis between the constituent atoms with which 
are associated rotational energies E tot ; vibration of the atoms 
within the molecule about their mean positions along the line 
connecting their nuclei, with which is associated vibrational energies 
JE'vib; transitions between orbits of the outermost electrons within 
the structure of the constituent atoms, with which are associated 
electronic energies E e i ec - In general, i? e iec =- E vib > E toi . 

If E yih > E > ^ot, molecular rotational energies only will be 
stimulated, giving pure rotational spectral bands. If-Eeiec >-E ^iTyib, 
both vibrational and rotational energies will be excited, giving 
rotational-vibrational spectral bands. If E =► JS'eiec all three kinds 
of energy levels are excited and the complete molecular spectrum 
is produced. 

Where ^vib > E > E Iot (so rotational spectra only are produced), 
the energy imparted to the molecule alters its angular velocity 
of rotation. If lis the moment of inertia of the molecule about its 
axis of rotation, the condition representing quantisation of the 
angular momentum is 

kh 

Ico = =— 

Alt 

where k is an integer, h is Planck's constant and co is the angular 
velocity of the rotation. The kinetic energy of rotation is therefore 
given by the equation 

E -i/ W 2_^! 
£ro, - s*« - gn2/ 

In order to introduce the generally applicable correction resulting 
from quantum mechanics, k 2 is replaced by k(k+l) (see Section 
2.9), so 

k(k+l)h 2 



^ro 



8ti 2 / 



In absorption spectra (the study of molecular spectra is usually 
by absorption rather than emission), if it is assumed that the 
energy of the photon of radiation absorbed increases k by unity, 



hv = -^ (*+ 1) (k+2)-k(k+ 1) 



QUANTUM THEORY OF RADIATION 8 1 

where v is the frequency of the radiation, so 

a series of lines in the rotational band spectra being obtained for 

k = 0, 1, 2, The frequencies involved in rotational spectra 

are in the far infra-red, where the corresponding wavelengths are 
about 10~ 2 cm or 10 6 A. The moment of inertia / of the molecule 
can be obtained from a study of these far infra-red spectra. 

Where E eIec > E > i?vib =*■ E Iot , both the vibrational energies 
and the rotational energies of the molecules are excited but not 
the electronic. 

The vibrational energies of a diatomic molecule are due to the 
oscillatory motion of the constituent atoms along the line joining 
their nuclei. These oscillatory motions are not simple harmonic 
ones and are consequently not easily analysed. The values of E vih 
are found to be quantised in accordance with an expression of the 
form 

£vib = (v+i)hv-(v+i) 2 hxv+(v+i) 3 hyv 

where v is the vibrational quantum number, h is Planck's constant, 
v is the frequency of the photon of radiation in the vibrational 
spectrum, and x and y are empirically determined anharmonicity 
constants. 

For a particular frequency i>i in this vibrational region, there 
will be an associated fundamental band of frequencies because of 
excitation of the molecular rotational energies (unless the material 
is near to the absolute zero of temperature); Equation 2.62 is 
therefore also involved to give 

h 2 
v=v 1 ±-^ J (k+l) (2.63) 

where v is a frequency in the rotational-vibrational band and has 

a series of values for k = 0, 1, 2, The rotational-vibrational 

band spectra are in the near infra-red region. 

Where the energy E imparted to the molecule exceeds the elec- 
tronic energy Eeiec, all three types of molecular energies are excited. 
Most of the characteristics of the spectra of molecules may then 
be explained on the assumption (not strictly true) that the total 
energy of excitation E e is equal to the sum of the electronic, vibra- 
tional and rotational contributions, so 

E e = E e \ ec +E vi b+E tot 

The transitions between the electronic energy levels E e i ec are gener- 
ally considerably more energetic than those between vibrational 



82 UNIVERSITY OPTICS 

levels and are correspondingly associated with the absorption or 
emission of the most energetic photons of frequencies within the 
ultra-violet and visible spectral regions, known as electronic spectra. 
Interpretation of the electronic spectra is in the light of the 
quantum numbers and allowed transitions for the constituent 
atoms. Suppose that a diatomic molecule comprises one atom 
having a resultant orbital quantum number £1 and a resultant 
spin quantum number Si (usually resulting from Russell-Saunders 
coupling) and a second atom for which these quantum numbers 
are L 2 and S 2 . For the molecule, the resultant orbital quantum 
number L is given in magnitude by 

L = Li+Lz-N 

where N = 0, 1 , 2, . . . , 2L, so that a series of 2L + 1 values of L are 
obtained. The resultant spin quantum number S is given in magni- 
tude by 

S = Si+Sz-M 

where M=0, 1, 2, ...,2S, so there are also 25+1 values of S. 
Between the nuclei of the two atoms in the molecule there exists 
a strong internuclear electric field. A consequence of this field is 
that the resultant orbital and spin angular momentum vectors of 
magnitudes Lhjln and Shjln respectively are caused to precess 
about the internuclear axis. The principle of space quantisation 
holds in a manner analogous to that in the magnetic case (Section 
2.9), in that a quantum number A prevails which is the projection 
of the resultant orbital quantum number L on the internuclear 
axis, where the possible magnitudes of A are given by 

A = L, L-l, L-2, ..., 0,-1, -2, ..., -L 

so there are 2L+1 values in all. 

To designate the electronic states of molecules, letters are used 
in a manner similar to that for atoms (Table 2.1) except that the 
Greek capital letters E, II, A, and are used instead of the equi- 
valent capitals S, P, D and F. Thus, the electronic state for which 
A = is characterised by 27; for A = 1, the letter II is used; 
A — 2 is represented by A; and A — 3 by 0. Symbols of the type 
3 A, for example, arise where the superscript denotes the multiplic- 
ity 2S+ 1 of resultant spin quantum numbers 5". 



2.22 The Raman Effect 

When light is incident upon a transparent medium and is scattered 
on its passage through the medium two types of scattering may be 



QUANTUM THEORY OF RADIATION 



83 



identified: Rayleigh scattering, in which the frequencies present 
in the scattered light are the same as those in the incident light, 
no frequency changes being experienced; and Raman scattering, 
in which the frequencies present in the scattered light are different 
from those in the incident light, and frequency changes do take 
place. 

In general, Rayleigh scattering predominates greatly in inten- 
sity. For particles in suspension in a liquid or gas, the Rayleigh 




LlflUlO IN 
CENTRAL TU8E- 



CLASS- 



LIQUID 
INTRODUCED 



BLACKENED DRAWN - 
DOWN TUBE 



-QUARTZ MEflClffiT 
ARC LAMP 




TO 
SPECTROMETER 

Fig. 2.17. Observation of the Raman effect in a liquid 



scattering intensity is proportional to the square of the volume of 
the particle and proportional to the fourth power of the frequency 
(i.e. proportional to 1 /A 4 , where X is the wavelength). In the Rayleigh 
effect, the photons in the incident light are elastically scattered by 
the molecules in the medium. 

The Raman effect, discovered experimentally by Raman and 
Krishnan in 1928 in India, and independently by Landsberg and 
Mandelstamm in Russia, is a consequence of inelastic scattering of 
the incident light photons by the molecules of a medium. For a given 
medium, the Raman scattering per unit volume is usually about 
0-001 of the intensity of the Rayleigh scattering. There is usually 
no phase relationship between the scattered light and the incident 
light in Raman scattering. 



84 



UNIVERSITY OPTICS 



Observation of the Raman effect demands intense incident light 
because the scattering intensity is low. Developments from a simple 
arrangement due to R. W. Wood are often employed. The scatter- 
ing liquid is in a cylindrical glass tube, about 1 cm in diameter and 
10 cm in length, which is strongly illuminated along its length by 
light from a quartz mercury arc lamp (Fig. 2.17); filters are inter- 
posed if it is required to isolate particular wavelengths. The light 



STOKES LINES 



EXCITING 
LINE 



2 ANTI- STOKES LINES 

3 



FREQUENCY SHIFT DUE TO RAMAN EFFECT, Av 

Fig. 2.18. A Raman spectrum for carbon tetrachloride (heights of lines are pro- 
portional to intensities) 



scattered in the liquid which passes down the axis of the tube is 
observed through a plane end-window by means of a spectrometer. 
The other end of the tube is blackened to provide a dark background 
to the scattered light observed. 

Whereas the Rayleigh scattering can be largely explained in 
terms of electromagnetic theory, the Raman effect demands the 
quantum theory. If the incident light photon of energy hv impinges 
upon a molecule of the scattering medium and the energy state of 
this molecule changes from Ei to E 2 , the energy of the Raman scat- 
tered photon is given by hv—(E 2 —E 1 ). Thus hv' = hv—(E 2 —Et), 
where v' is the frequency of the scattered light, provided that the 
incident photon excites a molecule of the scattering medium from 
a lower energy level E% to the higher one E 2 , so the energy of the 
scattered photon is less than that of the incident photon. This gives 
a Raman line of which the frequency is Av less than v by the 
amount v—v' = (E 2 —Ei)/h; such lines of lower frequency in the 
Raman spectrum are called Stokes lines. 



QUANTUM THEORY OF RADIATION 85 

Alternatively, but usually with much less probability, the molec- 
ule in an excited state £2 moves to a lower state isi by giving energy 
to the incident photon to form a more energetic scattered photon. 
Now, 

hv' = hv + (E 2 -E 1 ) 

where v' is the frequency of the higher frequency scattered radiation 
in the anti-Stokes lines. The frequency change is again Aj> = v' — v = 
{Et-Edlh. 

The Stokes and anti-Stokes lines thus appear in pairs which are 
symmetrical about the initial exciting line in the Raman spectrum. 
The anti-Stokes lines are weaker in intensity than the Stokes lines, 
because the excited states df the molecules at energies such as E z 
are more sparse than the ground states at energies such as Ei. 
A Raman spectrum for carbon tetrachloride is represented in 
Fig, 2.18. 

In the visible region, the Raman frequency changes Av owing to 
the excitation of vibrational energy changes in molecules have been 
useful in supplementing data about molecular vibrational energies 
obtained from near infra-red spectra (Section 2.21). 



2.23 The Stimulated Emission of Radiation 

So far, frequent use has been made of the quantised energy states 
in which atoms and molecules may exist, in particular in relation to 
their emission and absorption of radiation, but no reference has 
been made to the relative numbers of atoms or molecules in a given 
material which are in the various states. For a given mass of ma- 
terial in thermal equilibrium with its surroundings at an absolute 
temperature T, the appropriate equation is 

N e = g e N exp ( - ^~\ (2.64) 

where N e is the number of atoms (or molecules) in the excited state 
E e (so N e is said to be depopulation of the state E e ), E is the ground 
state energy for which the population is No, k is Boltzmann's 
constant, and g e is the degeneracy of the energy level E e (a factor 
required to take into account the fact that several states may have 
the same energy associated with them and are indistinguishable 
unless, say, a magnetic field is applied). 

From Equation 2.64 it is apparent that the population of the 
excited states is very much less than that of the ground state unless 
the temperature T is very high. Thus, since Boltzmann's constant 
k = (1/11,600) eVdeg- x K, the value of kT is (T/l 1,600) eV. 



86 UNIVERSITY OPTICS 

Suppose E e -E = 2 eV, then (E e -E )/kT = 23,200/7. Assuming 
for simplicity that g e is unity, 



= exp / _ 23 j; 00 \ = 10 -23 > 2oo/2.3r == 10 -io,ooo/r a pprox. 



N, / 23,200 

No 



AtT= 300°K (27°C), therefore, N e /N = exp ( - 33) ; so an exceed- 
ingly minute fraction of atoms or molecules will exist in a state of 
excitation at an energy 2 eV above the ground state at room tem- 
perature. Even at a temperature of 2,000°K, when N e /N Q = 10 -5 
approx., only one in 100,000 atoms or molecules are in excited 
states at 2 eV. 

Therefore, in order to create in a material a significant number of 
excitation energies, it is necessary to elevate the temperature exces- 
sively, or provide excitation by the impact of electrons or other par- 
ticles, as in a gaseous discharge, or indulge in the absorption of 
a photon of the appropriate energy. The last of these methods of 
excitation is our present concern. 

When a photon is absorbed by an atom or molecule, if its energy 
equals or exceeds E e —E for the material, the excited state E e will be 
produced. The photon of frequency v will provide an amount of 
energy hv, where h is Planck's constant = 6-63 XlO -34 Js = 
4,15 xlO -15 eV s. An excitation energy E e of2eV will thus be 
created by the absorption of radiation of frequency given by 

4-15Xl0- 15 v = 2 
Therefore 

?y 10 15 
„ = ±^i- = 4-8xlO"Hz 
4-15 

which corresponds to a wavelength of 6,250 A in the red part of the 
visible spectrum. The atom or molecule will remain in the excited 
state of energy E e for a spontaneous emission time before it decays to 
the ground state (or to some intermediate quantised energy level 
between E and E e in which it remains for a second spontaneous 
emission time) by the emission of another photon of energy 
E e —E(j. 

It is of interest to determine the rate at which energy levels are 
excited by the absorption of photons, the rate at which spontaneous 
emission occurs, and also to consider whether it is possible to stim- 
ulate emission of photons from the excited state to the ground 
state in such a way that more photons are emitted than are ab- 
sorbed, so that a form of amplification of radiation is obtained. 

The rate of excitation of an energy level E e or, more specifically, 
the time rate of increase of the population N e brought about 
by the absorption of photons of energy E e —E and frequency 



QUANTUM THEORY OF RADIATION 87 

v = (E e —E )lh, is taken to be proportional to the population No of 
the ground state and also proportional to the energy per unit vol- 
ume U(v) in the radiation due to these photons. Therefore, 

dN 

-^ = BU(v)N (2.65) 

where B is the Einstein coefficient for absorption, named after Ein- 
stein who introduced the concepts outlined in this section. 

If the material is in thermal equilibrium with its surroundings at 
an absolute temperature T (with no radiation from an external 
source incident upon it, so that the photons available for the exci- 
tation are simply due to the radiation from the material), Planck's 
equation for the distribution of energy in the spectrum (Equation 
2.1) applies: 

rr / N 871/lV 3 

U ^ = JtoWkD-i] (266) 

in which Ut(v) dv is the energy density at temperature Tin the small 
frequency interval v to v+dv, and c is the velocity of light in free 
space. 

The spontaneous emission of photons by the excited atoms or 
molecules is proportional only to the population N e of the excited 
state, so the rate of decay of excitation, or rate of decay of 
population of the excited state, is decided by 

dN 

e =-AN c (2.67) 



df 



where A is a second Einstein coefficient for spontaneous emission. 

To cause stimulated emission of radiation, the material is irradi- 
ated from an external source with photons of the same energy as 
those which are spontaneously emitted. The object is to trigger the 
excited states by the incident photons in such a manner that the 
number of photons emitted on de-excitation exceeds the number 
absorbed. Stimulated emission can take place on such irradiation, 
but whether it results in amplification or not is an additional matter 
requiring investigation. 

In such stimulated emission, the time rate of decay of population 
of the excited state (dN e jdt) is proportional to the energy density in 
the incident radiation as well as to the population, so 

' dN \ 

-CU(v)N e (2.68) 



m, 



where C is a constant and the suffix s denotes stimulated emission. 



88 UNIVERSITY OPTICS 

During stimulated emission, all three processes represented by 
Equations 2.65, 2.67 and 2.68 occur, so 

dAT 

' BU(v)No-AN e -CU(y)N e (2.69) 



dt 



Whilst it is clear from Equation 2.66 that the possibility of stimu- 
lated emission occurring is exceedingly remote and that amplification 
is impossible in a material in thermal equilibrium with its surround- 
ings at a temperature T and not subject to external radiation, it 
nevertheless leads to useful results to consider the situation where 
U(v) is simply U T (v) as decided by Equation 2.66. When the gas or 
solid material is in such thermal equilibrium, the principle of de- 
tailed balance requires that all transitions occur at equal rates in 
both directions. Therefore, 

dN 

■^P- = (2.70) 

dt 

From Equations 2.69 and 2.70, where U(v) = Uj{v), 

v *» = Twh=c < 2 "> 

The factor N /N e is given by Equation 2.64, where E e —E = hv 
and assuming g e = 1, to be 



No hv 

Substitution in Equation 2.71 gives 



N. =ap -kT 



™ = B*pW*T)-C (2 - ?2) 

As Equations 2.66 and 2.72 must be equivalent, it follows that 
B = C, which is generally true, and that for thermal equilibrium 

A _ 8^/ty 3 n _,. 

The ratio p/p s , where p is the probability of spontaneous emission 
&ndp s is the probability of stimulated emission, is given by Equa- 
tions 2.67 and 2.68 as 

P_ _ dN e ldt A 

p s (dN e jdi) s BU(v) K • 



QUANTUM THEORY OF RADIATION 89 

because B = C. For thermal equilibrium where no radiation is pres- 
ent except that due to the temperature of the gas or solid material 
itself, U(v) becomes Uj{v), so Equations 2.66, 2,73 and 2.74 can be 
combined to give 

j>_ _ A _ Sjihv* c^exp (hv/kT)- 1] 
Ps ~ BU T (v) c 3 X Znhv* 

Therefore 

At a temperature T of 300°K (27°C), kT = 300/11,600 = 0-026 
eV. For radiation in the visible spectrum at a frequency of, say, 
6X10 1 * Hz (A = 5,000 A), hv = 4-15 Xl0-«X 6x10" = 2-49 eV. 
Hence 

hv 2-49 
kT ~ ¥026 
so 

/» = [exp(96)-l]/», = 10*i-«p. 

Thus stimulated emission is exceedingly unlikely, except at high 
temperatures. Stimulated emission of any significance at optical 
frequencies consequently demands the use of external radiation so 
that Equation 2.74 applies. Even so, U(v) has to be substantially 
monochromatic radiation. To arrange that such stimulated emission 
results in a larger number of photons emitted than are absorbed 
(i.e. light amplification), the population N e of the excited state must 
be made to exceed the population No of the ground state (or at least 
of a lower energy state) a process known as population inversion. 
Population inversion leading to light amplification is practised in 
a technique known as optical pumping utilised in the optical maser 
or laser (Chapter 9). 

On the other hand, in the microwave region at a wavelength of, 
say, 1 mm corresponding to a frequency of 3X 10 11 Hz, the energy 
hv of the photon is only4-15Xl0- 15 x3xl0 u = l-245XlO" 3 eV. 
Nowat300°K, 

hv 1-245X10-3 



kT 2-6 X10- 2 
so 

P 



= 0-048 



= exp (0-048)- 1 = 0-05 

Ps 

and stimulated emission predominates. 



90 UNIVERSITY OPTICS 

In the microwave region, again a method of obtaining population 
inversion is required to lead to amplification; it is practised in the 
maser, an essentially electronic device not to be described further in 
this text. 



Exercise 2 

1. Write an essay on the introduction of quantum theory into theoretical phys- 
ics, dealing particularly with the main concepts needed to explain the dis- 
tribution of energy in the spectrum from a heated black-body, the emission of 
electrons from a conducting surface exposed to radiation, and the Bohr 
theory of the spectrum of the hydrogen atom. 

2. The work function energy of molybdenum is 4-15 eV. Calculate: (a) the 
threshold wavelength of the radiation which is just able to release electrons 
from this metal; and (b) the maximum speed of the electrons emitted when 
the molybdenum is exposed to ultra-violet radiation of wavelength 2,000 A. 

3. Given that the wavelengths A of the lines in the spectrum of the hydrogen 
atom may be calculated from the equation 



J_ _ e*m t /J 1_\ 

A ~ Se*ch 3 \m* «'/ 



where e and m, are the charge and mass respectively of the electron, e„ is the 
permittivity of free space, c is the velocity of light in free space, and h is 
Planck's constant, calculate: (a) the wavelengths of the radiation emitted in 
the visible spectrum for transitions from the energy level for which m = 2 
to levels for which n = 3, 4 and 5; and (b) the ionisation potential of atomic 
hydrogen. 

4. Calculate from first principles the radius of the Bohr orbit for the hydrogen 
atom in the ground state for which the principal quantum number « = 1. 

5. Write down the Schrodinger equation for the motion of a particle in one 
dimension under the influence of a force which may be derived from a poten- 
tial energy V(x), explaining the physical significance of the various terms. 
A spherically symmetrical state in three dimensions can be described by 
a wave function r _1 «(r) where u(r) obeys an equation exactly similar to the 
one-dimensional Schrodinger equation. Hence or otherwise, derive formulae 
and numerical values for the ionisation potential and Bohr radius of the 
ground state of the hydrogen atom. (M. P.) 

6. Write an account of the Zeeman effect, in which a distinction is made be- 
tween the normal and anomalous effects, and an experimental set-up is 
described suitable for determining by means of the Zeeman effect the specific 
charge elm, for the electron. 

7. Account for the simple Zeeman effect which is observed with certain spec- 
trum lines when the source is maintained in a magnetic field. 

A source, normally emitting a singlet line, is subjected to a magnetic field 
of 12 kilogauss (1-2 weber m -2 ) and is observed from a direction perpen- 
dicular to the lines of force through a sheet of, Polaroid and a Fabry-Perot 
etalon of separation 1 cm. The Polaroid is rotated until the interference 



QUANTUM THEORY OF RADIATION 91 

pattern consists of two overlapping ring systems. If the difference in order of 
interference between the two systems is 2-24 at the centre of the pattern, cal- 
culate the ratio of the charge of the electron to its mass and indicate the 
orientation of the transmission direction of the Polaroid. (Take c — 3 X 10 10 
cm sec -1 .) [Note: see also Section 4.2.] (L.P.) 

8. The D 2 line in the spectrum of the sodium atom is split into six components 
when the sodium source is placed in a magnetic field of flux density 3 Wb 
m -2 . The separations (in wave numbers) of these lines from; the original 
D 2 line in zero magnetic field are ±0-47 cm" 1 , ±1-41 cm -1 and ±2-35 
cm -1 . With reference to the appropriate energy-level diagram, give an out- 
line explanation of this phenomenon. 

9. Give a simple explanation of the Zeeman effect and draw diagrams to show 
the polarisations to be expected in the various components. How may the 
observed effect differ from your simple theory? 

Use your theory of the Zeeman effect to calculate the separations in the 
pattern to be expected for a singlet line of wavelength 5000-0 A in a field of 
185 kilogauss (or 18-5 weber m -2 ). (Take the electronic charge to be 1-60X 
10 _20 e.m.u. = 1-60X10 -19 coulomb, mass of electron = 91xl0 _M gm, 
c = 3 X 10 10 cm sec -1 .) (L.P.) 

10. Write an essay on the band spectra of molecules. 

11. Explain the meaning of the terms spontaneous emission of radiation, and 
stimulated emission of radiation. 

Give a theoretical argument which establishes that stimulated emission of 
radiation at optical frequencies is not normally possible for a system in ther- 
mal equilibrium with its surroundings. 



CHAPTER 3 



Polarisation of Light 



In electromagnetic theory it is shown that visible light consists of 
trains of electromagnetic waves having wavelengths between 
4xl0~ 5 cm and 7X10 -5 cm approximately. When travelling 
through an isotropic medium, the electric and magnetic field varia- 
tions in this radiation are both in directions perpendicular to the 
direction of propagation. As the electric field vector is generally 
perpendicular to that of the magnetic field and the magnitudes of 
the two are related in a manner depending upon the medium, it 
is usually only necessary to specify one of them. The one chosen is 
the electric field vector. At a particular instant of time at a specified 
point, this electric field vector in a beam of light will be in a certain 
direction. 

If the light is unpolarised, over a finite period of time this direction 
of the electric field will vary from one instant to the next to occupy 
all positions around the direction of propagation, where the likeli- 
hood of any one perpendicular direction being occupied is equal 
to that of any other. 

If by some means the electric field is confined to one perpendicular 
direction about the propagation direction, the light is said to be 
linearly polarised in a given plane. 

An example of polarised waves is provided by long-wavelength 
radio waves travelling over the Earth's surface. At a distance of 
a few wavelengths from the transmitter aerial, only the vertical 
component of the electric field vector can exist; any horizontal 
component will have been eliminated because it will set up, 
in the conducting earth, eddy currents which will dissipate quickly 
the energy in the horizontal field. Such radio waves, having no 
horizontal component of the electric field, are said to be linearly 
polarised in the vertical plane. 



POLARISATION OF LIGHT 

3.1 Polarisation by Selective Absorption 



93 



Polarisation by selective absorption occurs in some crystalline 
substances because they are not isotropic: for example, the permit- 
tivity and thermal expansion coefficients are not constants but de- 
pend on direction relative to the crystal axes. Crystals of quinine 
iodosulphate absorb energy from electric field variations in a partic- 
ular direction. Microcrystals of this material within a cellulose film 



PRIVILEGED 
DIRECTION 



0* 




ELECTRIC VECTOR 
IN ALL DIRECTIONS 



roiAftlSER 




ANALYSER 



SHEET 2 



NO LIGHT 
' EMERGES 



PRIVILEGED 
DIRECTION 



LINEAR POLARISATION 
IN VERTICAL PLANE 



SHEET I 



Fig. 3.1. Polarisation of light by sheets of Polaroid (sheet 1 is oriented so as to 

transmit vertical electric field vibrations and absorb horizontal ones; sheet 2 

transmits horizontal electric field vibrations and absorbs vertical ones) 

have their normally random orientations changed to one direction 
in the manufacturing process, which involves stretching the film. 
The resulting product— -Polaroid— is a greenish material produced 
in large thin sheets costing about two shillings per sq uare it^ h The- 
large area obtainable is convenient, and the angle of incidence of 
the light is relatively unimportant, but minor disadvantages are 
coloration and the transmission of not quite 100% polarised light 
(Fig. 3.1). 



3.2 Polarisation by Reflection 

The phenomenon of polarisation by reflection was discovered by 
Malus in 1808. It may be demonstrated experimentally by observing 
light from an extended source both directly and after reflection 
from, for example, a glass sheet, or the surface of water or even of 
polished wood (Fig. 3.2). The reflected light will appear to be appre- 
ciably plane-polarised. Rotation of the polaroid analyser will cause 
variation in the light intensity seen by the observer. 

As shown theoretically in Section 1.8, there is an optimum angle 
of incidence, the Brewster angle I B , at which the light reflected from 



94 



UNIVERSITY OPTICS 



EXTENDED / 
SOURCE OF 
LIGHT / 




GLASS SHEET.WATER OR 
POLISHED DIELECTRIC SURFACE 



OBSERVER 



POLAROID 



Fib. 3.2. Demonstration of the polarisation of light by reflection 

the polished surface of a transparent dielectric is plane-polarised 
to the maximum extent. This occurs when the reflected and refrac- 
ted rays are at 90° to one another. Referring to Fig. 3.3, where n is 
the refractive index of the dielectric relative to air as unity, 



and 
Therefore 



sin /, 



B 



sin R 



n = 



R = 90°-/« 



sin I B 
sin (90° -I B ) 



tan I B 




MAINLY PLANE-POLARISED 
LIGHT 



Fig. 3.3. The Brewster angle of incidence 



POLARISATION OF LIGHT 



95 



The reflected light is seldom completely plane-polarised, because 
scattering of light owing to the presence of dust on the surface 
forbids perfection. 

At the Brewster angle of incidence, the direction of the linearly 
polarised electric field vibration in the reflected light is perpendicu- 
lar to the plane of incidence (Section 1.8). Electric field variations 
parallel to the plane of incidence are eliminated from the reflected 
beam, but they appear in the refracted and transmitted beam. 
The transmitted light is never completely plane-polarised, because 
some of the incident vibrations perpendicular to the plane of inci- 
dence are transmitted. The unwanted component in the transmitted 
light can be reduced by passing the light through another plate, 
parallel to the first, or, better, through a pile of parallel plates. 




Fig. 3.4. Production of plane-polarised light on reflection at the Brewster angle 

The electric vector E x in the incident light (ray 1) along AO in 
Fig. 3.4 is resolved into two components : En in the plane of inci- 
dence, and E ir perpendicular to this plane (see Section 1.7). The 
reflected and refracted rays (rays 2 and 3 respectively) may be con- 
sidered as arising from a point source O in the dielectric medium 
just below the surface. When the reflected and refracted rays are 
separated by an angle of 90°, it is seen from Fig. 3.4 that the com- 
ponent E 3l in the plane of incidence is only possible in the refracted 
beam, and that such a component cannot exist in the reflected 
beam because En is parallel to OB and an electric vector in the direc- 
tion of propagation is untenable. 

The use of reflecting glass surfaces as polariser and analyser 
is shown in Fig. 3.5(a) and Fig. 3.5(b), and the use of the more 



96 



UNIVERSITY OPTICS 



PLATE 2 (ANALYSER) 



PLATE 2 




.PLANE-POLARISED 
* LIGHT 



UNPOLARISED^/VvZ^ PLATE I (POLARISER) 

light y^y 




POLARISER 



ANALYSER 



PLANE-POLARISED LIGHT 



UNPOLARISED 




Fig. 3,5. Use of plates of glass at the Brewster angle of incidence as polariser 
and analyser: (a) single plates, plate 1 parallel to plate 2; (b) single plates, 
plate 2 rotated through 90° relative to its position in (a); (c) pile of plates (the 
intensity of the plane-polarised light is reduced as pile 2 is rotated, and reaches 
zero when pile 2 is at 90° to pile 1) 

effective 'pile of plates' for these purposes is shown in Fig. 3.5(c). 
The pile of plates method is a cheap way of obtaining plane- 
polarised light, but it is cumbersome and seldom produces 100% 
plane-polarised light. 



3.3 Polarisation of Light by Scattering 

The production of polarised light by scattering has no advantages 
over other methods in laboratory practice, but it is important as a 
natural phenomenon. The electromagnetic theory involved is 
outlined in Section 1.9. Light from the sky, which is scattered by 
dust particles and gas molecules, and from a rainbow is partially 
polarised, and therefore variations in intensity will be observed on 
viewing it through a Polaroid sheet which is rotated. Although it is 
a simple matter to produce polarised light by scattering in a labo- 
ratory demonstration, it is not possible to make an analyser based 
upon the principle of scattering. 

Light is passed through a tank containing a cloud of fine par- 
ticles in the form of smoke or a colloidal suspension in a liquid 
(Fig. 3.6). If the incident light is unpolarised, the scattered light 



POLARISATION OF LIGHT 



97 



in two directions mutually at right angles is plane-polarised, as 
may be observed by the use of a Polaroid sheet. If the incident 
light is plane-polarised (by the insertion of a Polaroid sheet at D), 
then the transmitted light emerging at E is plane-polarised. As the 
plane of polarisation of the incident light is rotated by rotation 
of the sheet at D, the intensities at A and B in Fig. 3.6 vary so that 
when a maximum occurs at A there is a minimum at B, and vice 
versa. 



PLANE-POLARISED 
LIGHT 




PLANE OF POLARISATION AT 
90° TO THAT IN LIGHT TO A 



TANK CONTAINS SMOKE 
OR COLLOIDAL SUSPENSION 



POLAROID INSERTED 
WHEN REQUIRED 



Fig. 3.6. Demonstration of the polarisation of light by scattering 

The explanation given in Section 1.9 may be expressed simply 
by stating that an incident transverse wave will excite the scattering 
particles and, in the arrangement of Fig. 3.6, no horizontal vibra- 
tion can be propagated to B and no vertical vibration to A, so the 
two scattered components of the light when viewed at 90° to the 
direction of propagation are plane-polarised. 



3.4 Polarisation of Light by Passage through Wire Grids 

In 1963, Bird and Parrish made a polariser in the form of a set of 
parallel conductors. They used a transmission diffraction grating 
ruled with 20,000 lines to the centimetre; each ruling of the form 
shown in Fig. 3.7(a) was therefore of width approximately equal 
to the wavelength of blue-green light. Gold was then evaporated 
in a vacuum to deposit on the vertical left-hand side edges of the 
grating, so producing 20,000 gold conducting wires per centi- 
metre. The sloping bases of the grooves were free of gold. Such 
gold deposits were made several hundred atoms thick, but much 
less than the wavelength of the light. 
Unpolarised light normally incident on such a wire grid [Fig. 



98 



UNIVERSITY OPTICS 



3.7(b)] was shown on transmission to be plane-polarised, with the 
electric field variations in the electromagnetic waves confined to 
the direction perpendicular to the grid wires. The component 
parallel to the wires was absorbed. This is as would be expected, 
because any electric field component in the direction of the conduc- 
tor would set up alternating potential differences which would 



GOLD DEPOSIT ON ALL 
VERTICAL SUFACES 



ALL SLOPING 
SURFACES CLEAR 




5xl0'cm 



(O) 




PLANE-POLARISEO 
LIGHT 
E r ONLY 



GRATING WITH UNIFORM GAPS 

EACH OF WIDTH LESS 
THAN WAVELENGTH OF LIGHT USED 



UNPOLARISED LIGHT 
(COMPONENTS RESOLVED IN 
PERPENDICULAR DIRECTIONS) 



(b) 



Fig. 3. 7. Polarisation of light by a specially constructed wire grid: (a) transmission 

diffraction grating; (b) polarisation of the light (I signifies component parallel 

to wires; r signifies component perpendicular to wires) 



cause alternating currents to flow. These currents would dissipate 
rapidly the energy in the parallel field components. Excessive thick- 
ness of the gold deposit was avoided, as otherwise electric field 
variations across the thickness would be absorbed. 

It is interesting to note that this experiment shows that the 
simple analogy often quoted of the vibration of a rope in a slot 
is false in discussing the polarisation of light. 

A demonstration experiment similar to that of Bird and Parrish 
with much more easily constructed wire grids having spacings of 
the order of millimetres can be used when the incident radiation 
is in the form of microwaves which are of only a few centimetres 
in length. 



POLARISATION OF LIGHT 99 

3.5 Double Refraction and Polarisation of Light by Crystals 

There are several crystals which exhibit double refraction (also 
known as birefrigence): a beam of light incident upon such a 
crystal will give rise to two distinct refracted beams within the 
crystal. The two beams of light are plane-polarised in directions 
at 90° to each other. This double refraction was first observed 
by Bartholinus, in 1669, on studying calcite crystals. 

Unpolarised light incident on a calcite crystal will give rise to 
two rays within the crystal: the ordinary ray in the same plane 
as the incident ray, and the extraordinary ray which is not in the 
same plane as the incident ray. The crystal will also have two 
different refractive indices : n for the ordinary ray and n e for the 
extraordinary ray. Beyond the calcite crystal, two images of a 
single source of light are seen. If these images are viewed through 
a plate of tourmaline, which is optically active, rotation of the 
tourmaline changes the brightness of the two images and one of 
them attains maximum brightness when the other one becomes 
invisible. 

Calcite (or Iceland Spar), which is crystalline calcium carbon- 
ate, shows marked double refraction. If a calcite crystal is placed 
over a pinhole in a horizontal screen which is illuminated from 
below, and the crystal is oriented so that viewing is along the optic 
axis of the crystal, only one image of the pinhole is seen. When 
the crystal is in any other orientation, two images are visible. 
If the crystal is rotated about a vertical axis, a line through the 
two images will be parallel to the projection of the optic axis on 
the crystal face. 

The separation of the two plane-polarised beams in the calcite 
crystal is shown in Fig. 3.8. The calcite crystal is a rhomb, the six 
faces being parallelograms of which the angles are 101°55' and 
78°5'. At point X [Fig. 3.8(a)] and at the opposite corner, three 
obtuse angles each of 101 °55' meet. The optic axis of the crystal 
is a direction making equal angles with the three faces at X. 
Along the optic axis, birefrigence does not occur. Calcite and 
quartz crystals are uniaxial — they have only one optic axis. A prin- 
cipal section of the calcite rhomb is a plane which contains the 
optic axis and is normal to a face of the rhomb : there are clearly 
three principal sections for every point in the rhomb. Fig. 3.8(b) 
illustrates a principal section ABCD of the rhomb. The angle at 
A is approximately 71° and at B is approximately 109°. A beam of 
unpolarised light incident normally on the face AB at P gives rise 
to an ordinary ray, which is not deviated, and to an extraordinary 
ray, which is deviated (even though incidence is normal) and 
thus constitutes a laterally displaced emergent beam. In Fig. 
3.8(c) is shown the exit face of the rhomb: the electric vector 



100 



UNIVERSITY OPTICS 



- r NORMALLY INCIDeNT UNPOLARISED LIGHT 



OIRECTION OF OPTIC 

AXIS HAKES EQUAL ANGUS' 

WITH THREE FACES AT X 




EXTRAORDINARY RAY 



ORDINARY RAY 



IMAGINARY PIECE REMOVED TO SHOW 
VECTOR VtRRATION DIRECTIONS 



(O) 




EXTRAORDINARY RAY 

f-f— 



ORDINARY RAY 




EXTRAORDINARY 
RAY 



ORDINARY RAY 



Fig. 3.8. Double refraction in calcite crystal 

vibrations in the ordinary ray are parallel to the longer diagonal 
of this section, the vibrations in the extraordinary ray are parallel 
to the shorter diagonal. 



3.6 Prism Polarisers 

The Wollaston polariser [Fig. 3.9(a)] is made from two right- 
angled quartz prisms A and B with their hypotenuse faces in 
contact; this device is particularly useful in the ultra-violet. In 



POLARISATION OF LIGHT 



101 



OPTIC AXIS 
EXTRAORDINARY RAY 



UNPOLARISEO 




EXTRAORDINARY 
RAY 



ORDINARY RAY 




(°) 



(b) 



ORDINARY RAY- 




EXTRAORDINARY RAY 



(C) 

Fig. 3.9. Prism polarisers: (a) WoUaston polariser; 
(c) Glan-Foucault polariser 



(b) Rochon polariser; 



prism A, the direction of the optic axis is perpendicular to the 
direction of the normally incident light, in prism B, the optic 
axis is perpendicular both to the incident light and to the optic 
axis in A. The incident unpolarised light is divided into two com- 
ponents which diverge in prism B. In prism B, the extraordinary 
ray in prism A becomes the ordinary ray, and vice versa. Diver- 
gence occurs in such a way that two plane-polarised beams emerge 
in different directions from the exit side of B. These two beams 
have equal intensities, and this is a convenience in comparing im- 
ages of the two beams, which are side by side with vibrations per- 
pendicular to one another. 

The Rochon polariser [Fig. 3.9(b)], also of quartz and used in 
the ultraviolet, is similar to the WoUaston arrangement except 
that the first prism A now has its optic axis parallel to the normally 
incident light. The unpolarised incident beam of light is split into 
two plane-polarised components, but both travel with the same 
speed in the direction of the optic axis in A. The division of the 
two rays takes place at the boundary. The divergence is only about 
half that produced by a WoUaston prism of the same size. The 
advantage of the Rochon polariser is that the ordinary ray is 
transmitted without deviation for all wavelengths. It should always 



102 UNIVERSITY OPTICS 

be used so that the light travels first along the optic axis. If it is 
used the other way round, light rays of different wavelengths will 
emerge in different directions because rotation will depend on 
wavelength. 

The Glan-Foncault polariser [Fig. 3.9(c)] is cut from a large 
crystal, often of calcite, and consists of two right-angled prisms 
with their hypotenuse faces separated by an air-gap. The optic 
axes of both prisms are in the same direction, perpendicular to that 
of the incident light. The angle is chosen so that the ordinary 
ray in the first prism A is totally internally reflected at the hypot- 
enuse boundary. The thickness of the prism does not matter but 
the incident beam divergence must not exceed 7°. With = 38-5°, 
sin 6 = 0-6225 and, since n = 1-6584 at A = 5,893 A, 

n sin 6 = 1-033 

where n is the refractive index of calcite for the ordinary ray, 
which is totally internally reflected; similarly, since n e = 1-4864 at 
X = 5,893 A, 

n e sin = 0-93 

where n e is the refractive index of calcite for the extraordinary ray, 
which is transmitted. As compared with the Nicol prism (Section 
3.7) the Glan-Foucault arrangement is useful in that it can be used 
in the ultra-violet, and it is shorter but has a more limited angular 
field. Canada balsam used in the Nicol and other prisms absorbs 
ultra-violet radiation. 



3.7 The Nicol Prism 

First introduced in 1828 by Nicol, this prism is able to produce a 
100% plane-polarised beam of light without coloration. A calcite 
crystal is cut across a diagonal, and the two halves cemented 
together with Canada balsam or other optical cement. The ordinary 
ray is eliminated by total internal reflection at the Canada balsam 
layer. A polariser for visible light of excellent quality is thus 
available, but the difficulty of finding large crystals of calcite 
makes it an expensive device for producing plane-polarised light. 
A calcite crystal is chosen of which the ratio of the long edge 
to the short edge is between 3-0 and 3-7. The principal section 
ABCD of the rhomb has angles of 71° at A and C. The end faces 
AB and CD are first ground and polished to reduce these angles 
to 68°. The prism is then cut diagonally along BD [Fig. 3.10(a)], 
the plane of this cut being normal to the end faces AB and CD 
and normal to the principal section [Fig. 3.10(b)]. The cut surfaces 



POLARISATION OF LIGHT 



103 



OPTIC AXIS 



CANADA BALSAM 

BLACKENED SURFACE C 




7' EXTRAORDINARY RAY 



BLACKENED SURFACE 



ORDINARY RAY 



(a) 




SHADED PLANE IS 
PLANE OF CUT 



Fig. 3.10. The Nicol prism 

are then optically polished and cemented together again with 
Canada balsam. 

The principal refractive indices at the wavelength of sodium 
light (A = 5,893 A) for calcite are n = 1-66 for the ordinary ray 
and n e = 1-49 for the extraordinary ray; the refractive index n of 
Canada balsam is 1-55. A ray XY parallel to the long edges of the 
prism incident on the end face AB is divided into two components: 
the ordinary ray is totally internally reflected at the balsam layer 
provided that the angle of incidence exceeds 69° (which is the 
critical angle /, where sin / = 1-55/1-66, so J = 69°); the extra- 
ordinary ray, for which n e is less than n, is transmitted. The sides 
BC and AD of the Nicol prism are blackened so that light in the 
ordinary ray is absorbed. The transmitted extraordinary ray is 
completely plane-polarised, with its electric field vibrations parallel 
to the shorter diagonal of the end face of the rhomb. 

With the end faces of the Nicol prism ground to the angle of 68° 
specified and with Canada balsam as the cement, the angle at the 
apex of the cone of rays in air about the central ray XY parallel to 
the long edges of the prism can be about 24°. An incident ray will be 
at too great an angle to one side of XY if the angle at which the 
extraordinary ray meets the balsam surface exceeds the critical 
angle (when both the ordinary and extraordinary rays will be reflected 



104 UNIVERSITY OPTICS 

and lost); alternatively, if the incident ray is at too great an angle 
to the other side of XY, the angle of incidence of the ordinary ray 
will be below the critical angle of incidence at the surface and both 
rays will be transmitted. The field of view can be increased by 
cutting the Nicol prism so that its end faces are normal to the long 
sides of the rhomb. This also gives the advantages that reflection 
at the entrance end surface is decreased and the image is not dis- 
placed when the prism is rotated about a longitudinal axis. The 
disadvantage is that a larger calcite crystal is required, which is 
expensive. 

3.8 Huygens' Construction for Determining Refraction in Uniaxial 
Crystals 

Huygens' construction or principle is a geometrical method of 
calculating the shape of a wavefront at any instant of time from 
the known shape of the wavefront at any previous instant of time 
(see Section 7.6 of Volume 1). The principle is that every point 
on a wavefront is a source of secondary wavelets which radiate 
in all directions from their centres with the speed of propagation of 
the wave. The new wavefront is the envelope of the secondary 
wavelets. In a homogeneous isotropic medium, the wavefronts 
from a point source of light in the medium are spherical, but 
many crystals are anisotropic in that they have different optical 
properties in different directions. In uniaxial birefrigent crystals, 




OPTIC AXIS 



Fig. 3.11. Huygens' construction for representing propagation within a uniaxial 
birefringent crystal for incident collimated light 



POLARISATION OF LIGHT 



105 



ORDINARY 
WAVEFRONT 



UNP0LARI5ED LIGHT 
AT NORMAL 
INCIDENCE 



SSI 



TT 1 



'CRYSTAL 



EXTRAORDINARY 
WAVE FRONT 



I 

(a) 



-OPTIC AXIS NORMAL 
TO SURFACE 



k. J V V , 



5£ 



OPTIC AXIS PARALLEL 

TO SURFACE WAVEFRONTS POLARISED 

AT RIGHT ANGLES TO ONE ANOTHER 

(b) 



OPTIC AXIS PARALLEL 
TO SURFACE 




OPTIC AXIS AT ARBITRARY 
ANGLE TO CRYSTAL SURFACE 




CRYSTAL 



(C) 



Fig. 3.12. ffuygens' construction for propagation in a uniaxial birefringent 

crystal with normally incident unpolarised light : (a) optic axis normal to surface; 

(b) optic axis parallel to surface; (c) optic axis at angle to surface 

two wavefronts having different speeds depending on direction are 
produced except along the direction of the optic axis, where the 
wavefronts have the same speed. 

To apply Huygens' construction to light propagation in a uni- 
axial crystal, imagine a point a source within the crystal. The speed 
of propagation of each wavefront emanating from this point source 
depends on the refractive index. The wavefront at any time corre- 
sponding to the ordinary rays is a sphere about the point as centre ; 
the wavefront corresponding to the extraordinary rays is a spheroid 
generated by the revolution of an ellipse about an axis parallel 
to the optic axis. This agrees with the fact that the refractive index 
for the extraordinary ray has an extreme value in the direction of 
the optic axis and a lower extreme value in all directions perpendic- 
ular to the optic axis. 



106 UNIVERSITY OPTICS 

In Fig. 3.11, AB represents a plane wavefront of monochromatic 
light incident on a plane face AC of a doubly refracting uniaxial 
crystal. At point A, two disturbances originate in the crystal: 
the ordinary one, of which the wavefront is represented by a sphere ; 
the extraordinary one represented by a spheroid. For convenience, 
the optic axis in the direction AP is taken to be in the plane of the 
diagram and so of ABC. In this plane, the ordinary wavefront 
through P has a section POD — it is a circle about A as centre ; the 
extraordinary wavefront is an ellipse, part of which is PEQ. The 
axes of this ellipse are AP and AQ. Whilst light travels in air from 
B to C, light in the ordinary waves in the crystal travels from A to 
O; in the same time, light in the extraordinary waves in the crystal 
travels from A to P at one extreme and to Q at the other extreme. 
The ratios APjBC and AQ/BC therefore represent the extreme val- 
ues of the refractive index, the former for both rays (ordinary 
and extraordinary) in the direction of the optic axis, and the latter 
for the extraordinary ray in a direction perpendicular to the optic 
axis. The resultant wavefront due to all points between A and C 
becomes the tangent plane CO for the ordinary rays and the tan- 
gent plane CE for the extraordinary rays. For the extraordinary 
wavefront, which is elliptical, the ray direction AE is not normal 
to the wavefront. 

Fig. 3.12 shows Huygens' constructions for propagation in a 
uniaxial birefrigent crystal for normally incident unpolarised light. 
In Fig. 3.12(a), the optic axis is normal to the surface; in Fig 
3.12(b), the optic axis is parallel to the surface; and in Fig. 3.12(c), 
the optic axis is at an angle to the surface. 

3.9 The Production of Elliptically Polarised Light by Means of a 
Uniaxial Crystal Plate 

A light vibration incident upon a uniaxial birefrigent crystal is 
divided into two vibrations polarised at 90° to one another. These 
vibrations travel through the crystal with the same speed in the 
direction of the optic axis but with different speeds in directions 
other than the optic axis, the difference being related to the direc- 
tion of incidence relative to the optic axis. 

If the incident beam direction is at 90° to the optic axis, which 
is in the surface of the crystal, the maximum difference in speed 
will be obtained. Suppose this incident beam is plane-polarised 
in any direction and is of monochromatic light of frequency 
v = colln, where co is the pulsatance. This incident beam may be 
resolved into two vibrations at right angles to one another, repre- 
sented at time t by 

x = a sin cot 



POLARISATION OF LIGH1 107 

and 

y = b sin cot 

On emerging from the crystal plate, the vibrations are represented 
by the two equations 

x = asm (cor +61) 
and 

y = b sin (cot+8-i) 

because the two components travel in the crystal with different 
speeds so a phase difference is introduced. It is convenient to 
put 8 2 — 81 = 8, the phase difference between one emergent beam 
and the other, and write 

x = a sin cot (3.1) 

and 

y = b sin (cot -d) (3.2) 

The manner in which y varies with x can be found by eliminating 
t. To do this, Equation 3.2 is expanded to give 

y = b sin cot cos 8 — b cos cot sin 8 (3.3) 

From Equation 3.1 

x 

sin cot = — 

a 

and 

cos cot = V(l -sin 2 cot) = l/7l-ij\ 

Substitution in Equation 3.3 gives 

bx cos 8 ... ^ . . 
y = by 1 — A sin 8 






Therefore 

i>jt cos d\ 2 



Thus 



2faycosa , fc 2 * 2 cos 2 3 . fc 2 * 2 
.r 1 5 = b 2 sin 2 o r— sm 2 6 



108 
or 



UNIVERSITY OPTICS 



v 2 x 2 2xy cos S . „ . 

4s- + -s- —r = sin 2 b 

o 2 a 2 ab 



(3.4) 



Equation 3.4 is, in general, the equation of an ellipse, but special 
cases arise. 

If d = or tin, where n is an integer, Equation 3.4 reduces to 
y = bx/a, which is a straight line representing linearly polarised 
light in a plane. 

If 5 = n/2, or an odd integral multiple of n/2, Equation 3.4 
becomes 

x 2 v 2 

a 2 b 2 

which is the equation of an ellipse and represents a particular case 
of elliptically polarised light. 
If d = n/2 or an odd integral multiple of n/2 and a = b, 

x 2 +y 2 = o 2 

which is the equation of a circle: the light is circularly polarised. 



J 


..y 




j 


L 




i 


k 




i 


k 






b / 




/^ 


1 
1 
1 I 




f 


A 




\ 
\ 
\ \ 




-^ 


/ 


a 


"K 


/ ; 
/ / 

/ 




f 


V 


J 










4=0 




,.x 




«-f 




»-¥ 




▲ 




▲ 




▲. 




A. 




\ 




-> 


\ 
\ 
\ \ 






f 


\ 




/ 


/ 


— * 




\ 




\ \ 
\ < 
\ 


w 


V 


J 




/ 


\ / 




S 


*r 




i 


_ 5« 

4 




« 


2 




« 


_ 7 * 

= 4 





i*fe. i.ii. 7%« combination of two sinusoidal variations of the same frequency 
acting at right angles to one another and differing in phase angle by b 



POLARISATION OF LIGHT 109 

It is seen that elliptically polarised light is produced on passing a 
plane-polarised beam of light in a direction through a uniaxial 
birefrigent crystal so that the phase difference between the two 
emergent light beams differs by an angle d. 

At a given point in a beam of monochromatic plane-polarised 
light, the electric vector is confined to a straight line perpendicular 
to the direction of propagation and the magnitude of its time rate 
of variation is sinusoidal. For elliptical polarisation, the electric 
vector rotates about the direction of propagation and varies in 
magnitude with time in such a way that its end point describes an 
ellipse. 

The form of the ellipse depends on the value of d. Various values 
of (5 in Equation 3.4 give the traces shown in Fig. 3.13. Note that 
where d = n/2 (or an odd integral multiple of jt/2) the axes of the 
ellipse are aligned with the privileged direction in the crystal. 
The angle between the coordinate axes and the axes of the ellipse 
is given by 

. ,,„ 2ab cos d 

** n 2 ° = a*-b* (3 " 5) 



3.10 The Quarter-wave Plate 

A uniaxial birefrigent crystal plate of thickness such that the two 
emergent waves differ in phase by 90° is termed a quarter-wave 
plate, because such a phase difference clearly corresponds to a path 
difference of A/4, where X is the wavelength of the incident mono- 
chromatic light. A quarter-wave plate need not be extremely thin 
as there may be n waves in the direction of transmission in one priv- 
ileged direction and n+\ in the other, where n can be a large inte- 
ger. 

For monochromatic light of wavelength A, the minimum thick- 
ness x of a quarter-wave plate made from a crystal for which the 
extreme refractive indices are n e for the extraordinary ray and n 
for the ordinary ray, is given by 

x(n e -n ) = 0-25A (3.6) 

Quarter-wave plates are usually made of quartz or of mica (sand- 
wiched between glass plates) cut so that the optic axis is in the plane 
of the surface [see Fig. 3.12(b)]. For quartz, n e ~n = 9X 10 -3 at 
A = 5,893 A, so 

5,893 X10- 8 
*= 4X9X10-3 =H*XlO-3 cm 

Incident plane-polarised light with electric field vibrations in the 
direction OZ (Fig. 3.14) has perpendicular components of this field 



110 



UNIVERSITY OPTICS 



PERPENOICULAR 
VIBRATIONS IN PHASE 



OPTIC AXIS 




PRIVILEGED DIRECTIONS 

W THE aUARTER-WAVE 

PLATE 



QUARTER-WAVE 
PLATE 



■ PHASE DIFFERENCE OF 90" 
BETWEEN PERPENDICULAR VIBRATIONS 



Fig. 3.14. Action of the quarter-wave plate 

which vary sinusoidally with time in directions parallel to, and at 
right angles to, the crystal optic axis. In the crystal, one component 
vibration travels faster than the other; on emergence from the face 
ABCD of the crystal, the faster component vibration leads by 
a path difference of A/4 on the slower. The emergent light is in 
general elliptically polarised, though the special cases of emergent 
plane-polarised light occur when = 0° or 90° and circularly polar- 
ised light when = 45°, where 90° -0 is the angle between OZ 
and the optic axis. 

By reversing the direction of the light, the effect of the quarter- 
wave plate in changing elliptically polarised light to plane-polarised 
light is demonstrated. 



3.11 The Half-wave Plate 

The half-wave plate is similar to the quarter-wave plate, except that 
it is twice the thickness. Thus the equation for minimum thickness 
x Is seen by comparison with Equation 3.6 to be 

x(n e — n ) = 0-5A 

so for quartz at X = 5,893 A, the minimum thickness will be 3-28 X 
10 -3 cm. 

A diagram similar to that of Fig. 3.14 could be drawn to repre- 
sent the action of the half-wave plate; the difference between the 
emergent light from a half-wave plate and that from a quarter-wave 
plate is represented by Fig. 3.15. 



POLARISATION OF LIGHT 



111 





Fig. 3.15. Comparison of vibrations of the electric vector in light emerging from 
a quarter-wave plate and in that emerging from a half-wave plate: (a) quarter- 
wave plate (phase difference of 90° between perpendicular components) ; (b) 
half-wave plate (phase difference of 18(f) 

The electric vector OZ in the incident plane-polarised light may 
be resolved into the perpendicular components Oy and Ox, where 
^.ZOx = [Fig. 3.16(a)]. On traversing a half-wave plate, the Ox 
component gains half a wavelength on the Oy component, corre- 
sponding to a phase change of 180°. The direction of Ox is therefore 
reversed to become Ox' in Fig. 3.16(b). The resultant electric vector 




«. 




(a) 



(b) 



Fig. 3.16. Rotation of the plane of polarisation through a specific angle by means 
of a half-wave plate : (a) incident light; (b) emergent light 

in the emergent light is now represented by OZ' . The half-wave 
plate simply rotates the plane of polarisation through (180 — 2©)°. 
The emergent light is still plane and not elliptically polarised. 

Whereas the quarter-wave plate is useful in the analysis of polar- 
ised light (Section 3.12), the half-wave plate is not; its main function 
is as a half-shade device in a polarimeter (Section 3.22) permitting 
rotations of the plane of polarisation to be determined accurately. 



3.12 The Analysis of Polarised Light 

The four types of light concerned here are unpolarised (abbreviated 
to UP), linearly polarised in a plane (PP), circularly polarised (CP) 



1 1 2 UNIVERSITY OPTICS 

and elliptically polarised (EP). The total number of different com- 
binations appears to be fifteen. In fact only seven different types 
exist, and these may be identified by a Nicol prism and a quarter- 
wave plate. Listing the possible combinations: 

Alone (four possibilities): UP, PP, CP, EP. 

Two types (six possibilities): UP+PP, UP+CPandUP+EPare 
separately identifiable, but PP+CP, PP+EP and CP+EP are 
all indistinguishable from EP. 

Three types (four possibilities): UP + PP+EP, UP + PP+CP, 
UP+CP+EP and PP+CP+EP exist, but the last pair of 
components in each of these combinations is indistinguishable 
from EP; also, as the first component is either UP, (which will 
have no effect) or PP (which alters the EP), all these three types 
are indistinguishable from EP. 

Four types (one possibility): UP+PP+CP+EP is the one possi- 
bility, but PP+CP components are indistinguishable from EP 
and UP does not alter the character of EP, so the result is 
indistinguishable from EP. 

The seven distinguishable types of polarised light are thus seen to 
be UP, PP, CP, EP, UP+PP, UP+CP and UP+EP. 

3.12.1 USE OF A NICOL PRISM 

The Nicol prism (a sheet of Polaroid can be used as an alternative) 
transmits light which has its electric vector or a component of this 
vector in the privileged direction of the Nicol prism. Owing to this 
selective transmission, when linearly (plane) polarised light (PP) is 
analysed, there is one orientation of the Nicol prism in which the 
emergent light intensity is zero. 

Let the electric vector in the incident linearly polarised light be 
resolved into two 'vibrations' 

x = a sin cot 
and 

y = b sin cat 

where the x-axis is at an angle 6 to the privileged direction in the 
Nicol prism (this direction being parallel to the shorter diagonal 
of the end face of the Nicol rhomb, Section 3.7). The resultant am- 
plitude of the transmitted electric vector in the privileged direction 
(Fig. 3.17) is therefore 

R = a sin cot cos 6 + b sin cot sin = sin cot(a cos 6 + b sin 0) 



POLARISATION OF LIGHT 113 

whilst at 90° to the privileged direction, the amplitude is 

R' = b sin lot cos — a sin mt sin = sin a>t(b cos — a sin 0) 
The amplitude R' is zero when 

tan = - 
a 

Only with linearly polarised light in a plane (PP) is complete extinc- 
tion observed. 

With incident unpolarised light (UP), the electric vector can oc- 
cupy over a finite period of time any direction perpendicular to the 
direction of propagation. Rotation of the Nicol prism will merely 



■fc. 



y-bsin ut 



A 



/ 



/PRIVILEGED DIRECTION 
/ OF NICOIA PRISM 



\ 



V 



y 



'Sft-8 



A 



y x»a sin ut 

Fig. 3.17. Resolution of the electric vector in incident plane-polarised light into 

x and y components, where the x component is at an angle 6 to the privileged 

direction in a Nicol prism 

sample components of electric vectors in its privileged direction, 
which will be the same regardless of its orientation. Thus unpolar- 
ised light is not identifiable by a Nicol prism. 

The same effect occurs with incident circularly polarised light 
(CP). Representing CP by two 'vibrations' 



and 



a sin (ot 



y = a cos <ot 



reference to Fig. 3.18 shows that the resultant amplitude of the 
transmitted electric vector in the privileged direction of the Nicol 
prism is 

R = a sin tat cos 0+a cos cat sin = a sin (cot+6) 
The value of R does not depend on : it is a sinusoidal variation of 



114 



UNIVERSITY OPTICS 



y = a cos cat 



A 



y 



y~ 



/ 



/ "Privileged 
direction 



/ 



\ 



Fig. 3.18. Resolution of the electric vectors in incident circularly polarised light 

into x andy components, where the x component is at an angle 6 to the privileged 

direction in the Nicol prism 

the electric vector of the same frequency and amplitude (assuming 
there is no absorption of light in the Nicol prism) as the incident 
electric vector but differs in phase. Thus in examining UP, CP or 
a mixture of UP and CP, no change is observed in intensity on 
rotating an analysing Nicol or Polaroid. 

If the light incident on a Nicol prism is elliptically polarised 
(EP), the two x and y 'vibrations' relevant (Fig. 3.19) are 



a sin cot 



and 



y = b cos tot 

The resultant amplitude R of the electric vector in the transmitted 
light in the privileged direction is given by 

R = a sin cot cos + b cos cot sin 





X 




\ y 


J 


\ ' PRIVILEGED 






\ , ' DIRECTION 








y • b cos ut 




> \ 




y\ x 




/ \ x 




y- « \ \ 




y • k- 


y~ 


x*a sin ut 



Fig. 3.19. Resolution of the electric vectors in incident elliptically polarised light 

into x andy components, where the x component is at an angle to the privileged 

direction in a Nicol prism 



POLARISATION OF LIGHT 115 

Thus 

R — a sin cot cos 6+a cos cot sin 0— (a— b) cos of sin 
or 

R = a sm(fi>t+6)—(a—b) cos cof sin 6 

The term a sin (cot +6) is independent of 6, as in CP ; the term (a — b) 
cos cot sin represents a plane-polarised component which can be 
extinguished by rotation of the Nicol prism. Therefore, rotating 
a Nicol prism while observing EP will give maxima and minima. 
Consequently, EP+UP and PP+UP will be indistinguishable 
from EP by the use of a Nicol prism alone. 

So far, the use of only a Nicol prism allows only one (PP) of the 
seven types of polarised light to be identified. 

3.12.2 USE OF A NICOL PRISM AND A QUARTER-WAVE PLATE 

If the incident light is passed first through a quarter-wave plate and 
then examined with a Nicol prism, UP, CP, UP+CP, EP, EP+UP 
and PP+UP may be further investigated because the quarter- 
wave plate, by retarding one component relative to the other by 
a quarter period, converts PP to EP, or CP to PP, for example. 

The quarter-wave plate transmits light with the electric vectors 
vibrating in two planes mutually at right angles, one in the direction 
of the optic axis and the other at 90° to the optic axis. Hence the 
effect of the plate will depend on the direction of the electric vectors 
in the incident light. 

The effect of the quarter-wave plate is more complicated than 
that of the Nicol prism because the two components of the incident 
light each affect the intensity of the light transmitted in the two 
privileged directions in the quarter-wave plate. 

Consider plane-polarised light (PP) represented by 

x = a sin cot 
and 

y = b sin cot 

to be incident on the quarter-wave plate, the axes of which are X and 
Y (Fig. 3.20) and where cf> is the angle between the x-direction and 
the X-axis. The electric vectors of the incident light resolved in the X 
and Y directions are 

X = a sin cot cos cfr+b sin cot sin 4> 
and 

Y = b sin cot cos — a sin cot sin <j> 



116 



UNIVERSITY OPTICS 



Let the function of the quarter-wave plate be to advance the Y com- 
ponent by st/2. Assuming no loss of light by absorption in the plate, 
the electric vectors in the emergent light will be represented by 

X = a sin cot cos <p+b sin cot sin cf> 

which is unchanged, and 

Y = b cos cot cos <f>— a cos cot sin cf> 



y=b sin ut 




x«a sin cot 



Fig. 3.20. The x and y components of incident plane-polarised light relative to the 
privileged X and Y directions of a quarter-wave plate 

in which cos cot replaces sin cot because of the phase change by 
jr/2. Therefore 

X 

= sin cot 



and 



Therefore 



a cos cf>-\-b sin i/> 
Y 



b cos <j)—a sin cj> 
X 2 Y* 



cos cot 



+- 



(a cos cj>+b sin 0) 2 (b cos cj> —a sin <£) 



= 1 



which is the equation of an ellipse as ^ is a constant for a particular 
setting of the quarter-wave plate relative to the incident PP light. 
The emergent light is therefore elliptically polarised (EP). Thus PP 
light which is incident on a quarter-wave plate is converted, in gen- 
eral, into EP. 



POLARISATION OF LIGHT 117 

In the analysis of polarised light, this fact is only needed for the 
analysis of mixtures because PP alone is identified by the Nicol prism 
only, without the use of a quarter-wave plate. Viewing light from 
the quarter-wave plate with a Nicol prism will now give a partial 
minimum except in one position producing circularly polarised 
light, when no change in intensity occurs on rotating the Nicol. 

Unpolarised light (UP), represented by electric vectors equal in 
all directions around the direction of propagation, will be unaffec- 
ted by rotation of a quarter-wave plate. 

Consider elliptically polarised light (EP), and, as a special case, 
circularly polarised light (CP) incident on a quarter-wave plate. 
Let this EP be represented by 

x = a sin cot 
and 

y — b cos cot 

where for CP, a = b. The electric vectors in the incident light in the 
X and Y directions are 

X = a sin cot cos cf>+b cos cot sin <j> 
and 

Y = b cos cot cos <j>— a sin cot sin (j> 

In the emergent light, X will remain unchanged whereas 

Y = —b sin cot cos <j>— a cos cot sin <j) 

The similarity with the incident light vectors shows that the emer- 
gent light is also EP. 

Thus, a quarter-wave plate converts EP to EP except when 
cj> = 90°, i.e. when 

X — b cos cot 
and 

or 

Y = b 
which represents PP. Similarly, <j> = 0° when 

X = a sin cot 
and 

Y = — b sin cot 




1 1 8 UNIVERSITY OPTICS 

i.e. when 

Y _ -bX 
a 

which represents PP. It follows that, with incident EP on a quarter- 
wave plate followed by a Nicol prism, rotation of the Nicol will 
give complete extinction in a certain direction. 

In the special case of CP, a = b so the incident light is represen- 
ted in the X and Y directions by I=a sin (cot+<j)) and Y = 
a cos (cot +(f>), and the emergent light by X= a sin (cot +<j>) and 
Y = —a sin (cot+cj)); this gives Y = — X, which represents linearly 
polarised light at 45° to the axes of the quarter-wave plate. Orienta- 
tion of the quarter-wave plate has no effect. 

Further consideration of the use of a quarter-wave plate and 
a Nicol prism to analyse polarised light will now be given. 



Incident light is elliptically polarised (EP) 

Referring to Fig. 3.21, with the Nicol prism alone, maxima and 
minima will be observed in direction 8. If the quarter-wave plate is 
inserted between the incident light and the Nicol prism, it can be 
rotated to give a minimum. The quarter-wave plate converts the 
incident EP to emergent EP, except when the axes of this plate 
correspond to the axes of the ellipse representing the incident light ; 
when this happens, 

x = a sin cot 



DIRECTION OF MINIMUM ^ A DIRECTION OF MAXIMUM 



Fig. 3.21. Analysis of elliptically polarised light by a quarter-wave plate and a 
Nicol prism (X is an arbitrary direction taken to be horizontal for convenience) 



POLARISATION OF LIGHT 119 

and 



become either 

and 

or 

and 



y = b cos cot 
X = b cos cot 

Y =—acos tot 
X = a sin cot 

Y = —b sin cot 



which represent PP light. Thus rotation of the Nicol prism will give 
zero intensity in the directions at an angle cj> = tan -1 (a/b) or 
tan -1 (bid) from the axes of the ellipse. 

With the Nicol prism and the quarter-wave plate, the zero is in 
a different direction from that with the Nicol prism alone. 



Incident light is a mixture of elliptically polarised and unpolarised 
(EP + UP) 

As there is no intensity variation with for UP, with the Nicol 
prism alone the presence of UP will merely modify the shape of the 
ellipse but not the angle 6 giving the directions of maximum and 
minimum intensities. 

Insertion of the quarter-wave plate will convert the EP into PP 
in four orientations, as described in the section above, but no zero 
light intensity is observed because UP is unaffected by the quarter- 
wave plate. The direction <j> of the minimum will give the ratio of 
the axes of the ellipse where </> = tan -1 (a/b) or tan -1 (b/a). 

The minimum is in a different direction with both Nicol prism 
and quarter-wave plate from that with the Nicol prism alone. 



Incident light is plane-polarised mixed with unpolarised (PP + UP) 

With a Nicol prism alone, a minimum (but not of zero intensity) 
will be observed in a certain direction depending on the PP. 

Insertion of the quarter-wave plate will produce a 'best' mini- 
mum. The angle of setting of the Nicol prism will not change. 
Alternatively, the quarter-wave plate will convert the PP to EP. 
Thus the emerging light in the X and Y directions of the quarter- 
wave plate is represented by 

X = (b sin 4> + a cos cj>) sin cot 



*-* 
O 

& 

a 



s 

o 

"3 



■ o • 

ofi'S 

§ s J 

a ca o 

■"'« *o o 

c c z 



.S o 



.S fi 



ft- ft. 


a + + 


&< a- 


w a- 



So 
•2 * 

-° a 

3"§ 



« ft< 53 
fc: u m 

8 »§ 

» > t 
s * &• 



K ■-- 7» ♦* 



a * 

_ 06 



o 8 



U 



»-• -3 O 

en 5 
_• U 3 

O S .2 

.a * t£ 
225 



•a 2 

•a 



c .a 

O (U 

gX 



■o u 



s 



S 



z 
< 



i5 I 



Qc 



a 

a> 



a 



o 

< 




1 ■« 

3 ** 
BOc*. 

.s ° 

3 o 

2 S 




_*. a> 



5 ii rt 



!« 





.S«§ 


ft. 




u ^ 




.2 ^ 




X o 




3Z 



3 .2 -S 



II 



8 "1" 

h Si c 

"■3 .S 2 



and 



POLARISATION OF LIGHT 



Y = (b cos <j>— a sin <j>) cos cot 



121 



In one orientation of the quarter-wave plate at which PP be- 
comes CP, no variation of light intensity occurs on rotating the 
Nicol prism. Then 

b cos <$>— a sin <j> = b sin 0+a cos <j> 
Therefore 

sin <j) ^ , b— a 
tan <p = 



cos <f> 



b+ct 



The procedure in the analysis of any of the seven different types 
of polarised light is summarised in Table 3.1. 



3.13 The Babinet Compensator 

This is constructed from two quartz (or sometimes calcite) prisms 
of wedge form mounted in a holder with their hypotenuse faces 
adjacent and so that their optic axes are mutually perpendicular 



WEDGE 2 




OPTIC AXIS IN 

WEOGE 2 PERPENDICULAR 

TO PLANE OF DIAGRAM 



INCIDENT LIGHT 



DIRECTION OF OPTIC 
AXIS IN WEDGE I 



EMERGENT LIGHT 

WEDGE 2 

IS MOVABLE IN 

DIRECTION OF ARROWS 



WEDGE I 

Fig. 3.22. The Babinet compensator 



and both are perpendicular to the incident light (Rig. 3.22). The 
faces of the wedges are cut parallel to the respective optic axes. 
The wedge angles are each 0, where 6 is so small (about 2% °) that 
the separation of the ordinary and extraordinary rays is negligible. 
One wedge is fixed in the holder and the other one can be moved 



122 UNIVERSITY OPTICS 

by a micrometer screw so that its hypotenuse face slides over that 
of the adjacent fixed wedge. 

Quartz is a positive crystal, so light entering will produce an 
ordinary ray which travels faster than the extraordinary ray in the 
wedge 1. On traversing the hypotenuse interface between the 
wedges, the ordinary ray in wedge 1 becomes the extraordinary 
ray in wedge 2, and vice versa, because the optic axis of wedge 2 
is perpendicular to that of wedge 1. The speeds of travel of these 
two rays are consequently exchanged at the interface between the 
two wedges. 

Let the thickness of wedge 1 traversed by the transmitted light 
be dx, and let n be the refractive index of quartz for the ordinary 
ray and n e the refractive index for the extraordinary ray at a given 
wavelength X. For quartz, n < n e . The optical path length in 
wedge 1 is therefore n di for the ordinary ray, and n e dx for the 
extraordinary ray. The difference between these optical lengths 
is (n e -n )d 1 . 

The light then enters wedge 2 to traverse a thickness d 2 . On 
passing the hypotenuse interface between wedges 1 and 2, that 
light which constituted the ordinary ray in wedge 1 becomes the 
extraordinary ray in wedge 2, and vice versa. The optical path 
difference introduced in wedge 2 is (n e -n )d 2 , but this has to be 
subtracted from that in wedge 1 because of the interchange at the 
interface. The overall optical path difference between the two rays 
emerging from the compensator is therefore 

(n e - n )dx - (w e - n )d2 = (n e - n ) (di - d 2 ) 

As a path difference equal to the wavelength A corresponds to a 
phase change of 2jt, the phase difference <j) between the two rays 
on emergence from the two wedges of the compensator is seen 
to be given by 

(j, = ^-(n e -n )(.d 1 -d i ) (3.7) 

The path lengths dx and d 2 in the wedges can either both be altered 
by changing the point of incidence of a narrow beam of light on 
the face of wedge 1 or, by moving wedge 2 with the micrometer 
screw while wedge 1 is fixed in position, d 2 can be altered and dx 
kept fixed for a given point of incidence. The phase difference tj> is 
zero when dx = d 2 , as is seen from Equation 3.7. 

A disadvantage of the Babinet compensator is that a specified 
value of <j> is confined to a narrow region of the compensator 
parallel to the refracting edges of the wedge-shaped prisms. This 
drawback is overcome in the Soleil compensator which is a modi- 
fication of the Babinet compensator (Section 3.16). 



POLARISATION OF LIGHT 123 

3.14 The Babinet Compensator Between Crossed Polarisers 

Suppose the whole surface of the Babinet compensator is illumi- 
nated with collimated plane-polarised monochromatic light ob- 
tained by the use of a Nicol prism or by a sheet of Polaroid placed 
between a collimator and the compensator. It is convenient to 
adjust the plane of polarisation of this incident light to be at 45° 

UNPOLARISED MONOCHROMATIC LIGHT 
(COLLIMATED) 



PLANE- POLARISED 
LIGHT 
TO OPTIC AXES 



)LAR1SED , 

AT 45° -,' 

IC. »XF<L ' 



OPTIC AXIS 



PLANE- POLARISED \, 

LIGHT AT 45° -y 



TO OPTIC AXES 




POLARISER 



OPTIC AXIS 



WEDGE 2 



ANALYSER 

CROSSED WITH 

POLARISER 



INTENSITY OF 
LIGHT TRANSMITTED 



LINEAR SEPARATION BETWEEN 

NEIGHBOURING DARK BANDS 

CORRESPONDS TO <f>=2« 

Fig. 3.23. The action of the compensator between 'crossed' polarisers (EP denotes 
elliptically polarised light) 



to the optic axes of the wedges. The ordinary and extraordinary 
rays in the compensator will then have equal amplitudes. 

The transmitted light is viewed with a Nicol prism or Polaroid 
adjusted to cut out the incident light if the compensator were 
absent. Thus the polariser 1 in the incident light is 'crossed' with 
respect to the polariser (analyser) 2 in the emergent light. 



124 UNIVERSITY OPTICS 

Wherever in the field of illumination of the compensator the 
two wedges are of such thicknesses d\ and d z as to give <f> = 
or 23t, dark bands will appear in the field observed through the 
analyser. In between these dark bands, where (f> ?± or In, the 
light will not be eliminated. At halfway points between the dark 
bands, where <f> =n, the light will be plane-polarised. At all other 
positions, the light will be elliptically polarised. These observations 
are illustrated by Fig. 3.23. 

If the incident light is white, the band at <j> = will be black 
but the others will all be coloured because <j> now depends on the 
wavelengths of the constituents of white light. The use of white 
light and observation of this black band in the field of view of the 
analyser gives a useful means of determining where the phase 
difference (/> is zero. 

With monochromatic light, the rays as represented in Fig. 3.23 
do not move on sliding one wedge over the other (by means of the 
micrometer screw), but the positions of the dark bands do move. 
Let s be the linear separation between neighbouring dark bands 
for a given wavelength X. This distance s corresponds to a phase 
difference of 2n. If a specific dark band is moved through a distance 
/ by movement of the micrometer screw which slides one wedge 
over the other, the phase difference introduced by this movement 
is therefore {Ijs)2ji. The micrometer screw rotation can therefore 
be calibrated for a given wavelength in terms of the phase difference 
it introduces. 



3.15 Analysis of Elliptically Polarised Light with a Babinet 
Compensator 

The calibration of the compensator is valid only for a particular 
wavelength. If sodium light is used, for example, the compensator 
must be calibrated for that light. 

A source of monochromatic light (a mercury lamp with a filter, 
or a sodium lamp) is used with a collimator and polariser (Nicol 
prism or Polaroid) set to produce plane-polarised light incident 
upon the Babinet compensator, the direction of linear polarisation 
being at 45° to the optic axes of the compensator. A second polar- 
iser — usually a Nicol prism mounted on the compensator — is used 
to analyse the emergent light. This second Nicol will produce the 
dark bands corresponding to # = 2sr when set in a particular 
orientation, which is the 'crossed' position relative to the first Nicol 
prism. 

On substituting white light for the monochromatic light, the 
'zero' position when d\ = di and <j> — (Equation 3.7) is recognis- 
able as a black band flanked on each side by coloured bands. 



POLARISATION OF LIGHT 125 

The cross-wires of an eyepiece used to observe the light from the 
second analyser Nicol prism are set on this zero position black 
band. 

The source of elliptically polarised light (monochromatic of 
the same wavelength as the original plane-polarised light) is now 
substituted for the plane-polarised source. The elliptically polarised 
light comprises two electric vectors at right angles to one another, 
with a phase difference d between them. This phase difference is 
added to that produced by the Babinet compensator. The zero 
position black band will therefore move in the field of the eyepiece 
from that which it occupied when white light was used. 

If the total phase difference (6 plus that due to the compensator) 
is adjusted by movement of the compensator micrometer to be 
zero or n, the light emerging from the compensator will be plane- 
polarised (see Fig. 3.13). However, it will not in general be at 45° 
to the optic axes of the quartz wedges of the compensator, so the 
analyser Nicol and the Babinet compensator will both have to be 
rotated to produce the darkest band possible. 

The distance is noted through which the micrometer screw is 
moved to go from the zero position band with white light to the 
dark band (after rotation of the analyser and the compensator) 
with the elliptically polarised light. As shown in Section 3.14, 
this recorded traverse of the calibrated micrometer screw gives the 
phase angle d or n— b. 

If the privileged direction (i.e. along the shorter diagonal of the 
end face of a Nicol prism) and the setting of the axes of the Babinet 
compensator at which the dark band is observed with incident 
elliptically polarised light (rendered plane-polarised by appropriate 
adjustment of the Babinet compensator) are both recorded, and 
if 6 is the angle between the privileged direction of the Nicol and 
the direction of the optic axis of the first compensator wedge, then 

tan 6 = — 
a 

where b and a are the axes of the elliptical vibration. 

3.16 The Soleil Compensator 

This is a modification of the Babinet compensator with the advan- 
tage that the same phase change occurs over the whole area of 
overlap of the prisms. The two wedge-shaped prisms 1 and 2 of 
quartz have their optic axes in the same direction, both perpendic- 
ular to the incident light, and their hypotenuse faces are adjacent 
so that wedge 1 can be moved by a micrometer screw to vary the 



126 



UNIVERSITY OPTICS 



OPTIC AXIS r"t 

\ 




OPTIC AXIS PERPENDICULAR 
TO PLANE OF DIAGRAM 



Fig. 3.24. The Soleil compensator 

thickness d x through both prisms (Fig. 3.24). Behind the wedge 2 is 
a plane-parallel plate of quartz of thickness rf 2 , the optic axis of 
which is perpendicular both to the parallel optic axes of wedges 
1 and 2 and also to the incident light. In this way is provided a 
compensator of which the thickness can be varied, but this thickness 
is the same for all points on its surface at which the light may be 
incident. 



3.17 Optical Activity: Rotation of the Plane of Polarisation 

In Section 3.9, the behaviour of birefringent crystals in producing 
two plane-polarised beams of light from an unpolarised incident 
beam was stated to be unique in the direction of the optic axis 
of the crystal in that both these beams supposedly travel at the 
same speed in this direction and no double refraction occurs. 
One might expect that light travelling along the optic axis would 
be completely unchanged. In fact, if plane-polarised light is inci- 
dent, the plane of polarisation is rotated on traversing the crystal by 
an amount depending on the material and on the length of the 
optical path in the crystal. 

Many substances exhibit this effect and, strangely, many are 
liquids or solutions where random orientations of the molecules 
might be expected to lead to cancellation of this effect. Substances, 
solid and liquid, which rotate the plane of polarisation are said to 



POLARISATION OF LIGHT 127 

be optically active. They are divided into two groups : dextrorota- 
tory and laevorotatory. The former rotate the plane of polarisation 
to the right, i.e. clockwise when the observer looks in the direction 
of the oncoming light; the latter cause rotation to the left, i.e. 
anticlockwise. 

Optically active organic liquids usually contain a carbon atom 
in the molecule, which is asymmetrically arranged, or have a 
structure like that of turpentine. The amount of rotation depends 
on the wavelength of the light — a phenomenon known as rotatory 
dispersion — and, in solutions, on the concentration, though the 
solvent may cause small variations. 



3.18 Fresnel's Theory of Natural Rotation 

This optical activity intrinsic to the material, referred to as natural 
rotation, is explained by Fresnel on the assumption that a plane- 
polarised beam of light incident on an optically active medium is 
decomposed into two circularly polarised beams having slightly 
different speeds in the substance. At any point in the medium at a 
linear distance z from the incident face, these two components 
can be combined to form a plane-polarised wave of which the 
plane of polarisation is turned through an angle which increases 
with z. This entails a different refractive index for the two circularly 
polarised components. 

A circularly polarised wave can be represented by two equations 
giving the vibrations of the same amplitude in two privileged 
directions. These are 

2:r / z \ 

and 

.271/ Z\ 

yi = asm — (t--) 

representing a right-handed circularly polarised wave travelling 

with speed »i in the z-direction, where T is the period, whilst x t 

and ji are the displacements at time t. 

The other wave (the left-handed circularly polarised one) is 

represented by 

2jt / z \ 

x 2 = - a cos -=- It 

T \ v 2 j 



and 



2n 
j> 2 = a sin -— 



which has a speed v%. 



K) 



128 



UNIVERSITY OPTICS 



On emerging from the optically active medium, the two circu- 
larly polarised waves are superimposed to form a plane-polarised 
wave. The resultant amplitudes x and v are seen to be given by 



X = Xi + X% 



In I z\ In I z\ 
os -=- I* 1 —cos -— (/ I 

„ . 2n [ z/1 1 \1 . nz 

- 2asm ^r\'-i{- l + n)\ s "'-T 



\V2 V X ) 



and 



y = J1+J2 



sin 



= 2a sin 



In 
In 



(<-i) +sin iH'-^) 

z I \ 1 \1 nz 

2 \v x v 2 /J T 



\V 2 V X ) 



Therefore 



y nz 

^- = -cot — 
x T 



\V2 VlJ 



As z increases from zero, yjx will vary between + °°, and 
and then back to + «>. A complete rotation will occur when 



■cot 



, ( \ = —cot In 

T \v 2 v x J 

giving the value of z for one complete rotation to be when 

(-L-- L)=2* 

\ v 2 v x ) 



nz 



i.e. when 



IT 



z = 



(lM)-OM) 



Hence, in a distance of z cm the plane of polarisation rotates through 
2n rad, and in a distance of 1 cm it rotates through 2njz rad. 
Thus, the rotation per centimetre is given by 

n I 1 1 \ . 180° / 1 1 \ 
rotation per centimetre = -^ (__— j rad = -^ {— -— j 

(J 3.8) 



POLARISATION OF LIGHT 129 

If A is the wavelength of the light in free space, T = c/a where 
c is the speed of light in free space. Equation 3.8 then becomes 



180° 
rotation per centimetre = — -. — 



\v 2 Vi) 



Now, c/i>2 = n 2 and c\v\ = «i, where n% and «i are the refractive 
indices of the material for left-hand and right-hand circularly 
polarised light respectively. Therefore, 

180° 
rotation per centimetre = — ^— (« 2 — «i) (3.9) 

A 

If the two refractive indices are the same, the rotation of the plane 
of polarisation is zero: the material is not optically active. 

For solids, the rotation produced by a plate of 1 mm thickness 
is known as the specific rotation; it is seen to be (18° /a) (n 2 -ni) 
from Equation 3.9. Throughout most of the visible spectrum, the 
specific rotation a varies with wavelength X according to the equa- 
tion 

x = A + ^ (3.10) 

where A and B are constants for a given material. Thus rotatory 
dispersion is present. 



3.19 Experimental Support for Fresnel's Theory of Natural 
Rotation 

As stated in Section 3.18, Fresnel's theory presumes that plane- 
polarised light travelling in the direction of the optic axis in an 
optically active material such as quartz is divided into two circu- 
larly polarised beams travelling with slightly different speeds. The 
refractive index n, for left-handed quartz at a wavelength of 
4,000 A is 1-55821, an d for right-handed quartz it is n r = 1-55810. 
Fresnel verified his theory by constructing a combination of three 
quartz prisms to form a parallelipiped (Fig. 3.25) which produced 
a divergence of the two components so that each could be sepa- 
rately investigated and shown to be circularly polarised by the use 
of a quarter-wave plate and a Nicol prism. 

Thus, in Fig. 3.25, at the face AC the incident light is divided 
into two components by the prism ABC made from right-handed 
(denoted by R m the diagram) quartz, and the right-handed cir- 
cularly polarised beam travels faster than the left-handed one. 
This faster beam in ABC becomes the slower beam in the left- 
handed (denoted by L) quartz prism CBD, and vice versa, and 



130 



UNIVERSITY OPTICS 




PLANE -POLARISED LIGHT 



LEFT-HANDEO CIRCULARLY 
POLARISED LIGHT 



RIGHT-HANDED CIRCULARLY 
POLARISED LIGHT 



Fig. 3.25. FresneVsparallelipiped constructed from three quartz prisms to separate 
the incident polarised light into two circularly polarised beams (at A = 5,893 A: 
in R, n, = n r +71xl0- e ; in L, n, = n T -7lXlO- e ) 

again there is an interchange on traversing the boundary BD into 
the third right-handed quartz prism BDE. At each refraction, the 
separation of the rays is increased. 



3.20 Magnetic Rotation: The Faraday Effect 

In 1845, Faraday discovered that the plane of polarisation of light 
is rotated on passing through transparent isotropic materials in a 
magnetic field. Later, in 1854, Verdet found that the angle of 
rotation 6 is proportional to the length / of the path in the material 
and the magnetic flux density B. Hence 

6 = wlB 

where w is Verdet s constant, which varies with the wavelength of 
the light concerned but is almost independent of temperature. 

In an experiment to determine Verdet's constant (Fig. 3.26), 
a cylinder of the transparent material is placed along the axis of a 
long solenoid which produces a magnetic flux density B (about 
0-06 Wb m -2 or 600 gauss is suitable) known in terms of the measured 
current / through the solenoid and its number of turns per unit 
length. The incident monochromatic collimated beam of light (for 
example, from a sodium lamp with collimator) is plane-polarised 
by Polaroid 1, and the emergent light is analysed by Polaroid 2. 
Polaroid 2 is first adjusted to be crossed with respect to Polaroid 1 
when the magnetic flux is absent. The known magnetic flux den- 
sity is then established by switching on the solenoid current, and 
the angle through which the plane of polarisation is rotated is 
determined by rotation of Polaroid 2 to restore a minimum in 
the emergent light. A magnetic flux density of 0-6 Wb m -2 will 
produce a rotation of the plane of polarisation of about 20°. For 
a cylinder of a heavy flint glass, w = 6 /IB will be 0-0647" cm -1 G. 



POLARISATION OF LIGHT 

SOLENOID 



131 



MONOCHROMATIC 



COLLIMATED LISHT 




TRANSPARENT 



MATERIAL 



>■ 



OBSERVER 



POLAROID 2 




POLAROID 



Fig. 3.26. Experiment to determine Verdefs constant for glass or perspex in the 
form of a cylinder 

The magnetic flux density B is related to the current / (in am- 
peres) and the number of turns n per metre of the solenoid by 

B = fionl 

where /xo is the permeability of free space = 4jtX 10 -7 H m -1 . 
The Faraday effect in solutions is not proportional to the con- 
centration, and the magnitude of the effect at a temperature t°C 
is shown by the value of w for carbon disulphide (CS 2 ) for which 

w = 0-04347(1 - 1-69X 10- 3 /)l 

A molar magnetic rotation [M B ] may be defined as MxqiI Miohq, 
where Mi, a. x and g x are the molecular weight, the rotation for a 
known magnetic flux density, and the density of water respectively; 
M, « and q are these values for another liquid. 

In a homologous series, the addition of a CH2 group increases 
[M B ] by 1-023, i.e. 

[M B ] = 1-Q23N+S 

where N is the number of CH2 groups and s is a constant for 
a given series. 



3.21 Rotation of the Plane of Polarisation of Microwaves 

Materials such as sugar solutions, turpentine and quartz do not 
measurably rotate the plane of polarisation of microwaves, i.e. 
radio waves of lengths of the order of a few centimetres. However, 



132 UNIVERSITY OPTICS 

the Faraday effect occurs in the microwave region for certain para- 
magnetic salts, ferrites and plasma in electric discharges in gases. 

A twisted waveguide rotates the plane of polarisation of the 
microwaves that it transmits. Similarly, the plane of polarisation of 
light can be rotated by passing it through a twisted stack of mica 
plates whose optic axes were in the same direction before twisting. 
A succession of polaroid sheets behaves in the same manner. 

The X-ray investigation of an optically active solid such as quartz 
shows that atoms are arranged in right-handed or left-handed he- 
lices. The oscillations of the electrons in these atoms associated with 
the transmission of an electromagnetic wave have a direction of 
easy constraint which changes uniformly from atom to atom, thus 
rotating the plane of polarisation. 



3.22 Polarimeters and Saccharimetry 

The property of many substances of rotating the plane of polarisa- 
tion of light permits quick analysis and is also valuable in that, if 
a product is dextrorotatory whereas the reactants are laevorotatory, 
then the speed of a chemical reaction and its endpoint may be 
readily determined. Many organic materials exhibit this optical 
activity, but more especially sugar solutions may be quickly identi- 
fied and analysed; hence the name saccharimetry. 

The basis of all saccharimeters is a Nicol prism to convert ordi- 
nary monochromatic unpolarised light into plane-polarised light, 
and a coaxial analysing Nicol so orientated initially that no light is 
transmitted. The optically active material or solution is interposed 
between the two Nicol prisms; this rotates the plane of polarisation 
so that further rotation of the analysing Nicol is needed to restore 
darkness. Usually, cylindrical tubes of the material or of glass con- 
taining the liquid of two different lengths are used to avoid confusion 
in measurements of the angle of rotation between 6 and 2jz+Q. 
Other refinements in the instruments are introduced to make the 
measurement of d highly accurate. Monochromatic light must be 
used because 8 depends on the wavelength; for quartz at 20°C, 
this rotation varies between 48° per millimetre at 4,000 A approx. 
to about 16° per millimetre at 6,800 A. 

The rotation in a liquid depends on the concentration. If 6 is the 
rotation measured for a length of path in the solution of/ decimetre 
and o is the density of the solution, then the specific rotation or 
specific rotatory power [a] is given by 



POLARISATION OF LIGHT 



133 



If c is the mass of solute in 100 cm 3 of a solution (note: not 
100 cm 3 of the solvent), 

1000 

M = -7<r 



The molar rotation [M] is the product of the molecular weight and 
the specific rotation; i.e. 

[M ] = M [a] 

The rotatory dispersion is given by Biot's law (1817): 

[a] = A + -p 

where A and B are constants for a given material and A is the wave- 
length. 

In a simple polarimeter, the extinction position of the analyser is 
seldom precise; thus the early Mitscherlich polarimeter (1844), 
which uses only a simple polariser and analyser, is not sufficiently 
accurate for most purposes. 

The Lippich polarimeter (Fig. 3.27) is similar to the Mitscherlich 
instrument, but it incorporates a subsidiary Nicol prism N between 
the polariser and the tube of optically active material. This Nicol 
N covers one-half of the field of view. If the polariser and analyser 
are crossed initially, the inclusion of N will cause one-half of the 
field to become light; the intensity of the light seen through the three 
Nicol prisms will depend on the degree of rotation of N relative to 
the analyser. This light intensity can be adjusted to be within the 
range at which the eye is most sensitive. If, now, the analyser is 
rotated, the part of the field which was dark becomes light and the 
part which was light becomes darker, until a position is reached at 
which both halves of the field of view are equally illuminated. The 
optically active material is then introduced, and restoration of 
equally illuminated halves is restored by suitable rotation of the 
analyser. 



POLARISER 



MONOCHROMATIC . 
LIGHT -T- 



\z\ 



TUBE CONTAINING 
OPTICALLY 



ACTIVE LIQUID 



DIVIDED 

CIRCULAR 
SCALE 



/ 
ANALYSER 



> 



OBSERVER 



Fig. 3.27. Outline of the Lippich polarimeter (N is replaced by a half-shade 
device in the Laurent polarimeter) 



134 



UNIVERSITY OPTICS 




CEMENTED JUNCTION BETWEEN 
QUARTZ HALF-WAVE PLATE AND GLASS 

Fig. 3.28. The Laurent half-shade plate 



The condition of equal illuminations of the two halves of a field 
of view can be judged with considerably greater precision than the 
position of zero illumination which would have to be obtained if the 
Nicol prism N were absent. 

In the Laurent polarimeter, the Laurent half-shade plate is used 
in place of the Nicol prism N. This consists of a semicircular plate 
of quartz ABC (Fig. 3.28) cut so that the optic axis is parallel to the 
diameter AOC. This quartz plate is half a wavelength thick for a 
given wavelength X. It is cemented along the diametrical edge AOC 
to a similar semicircular plate of glass ADC. The thickness of this 
glass is such as to transmit the same amount of light as is trans- 
mitted by the quartz plate ABC. The quartz plate introduces a phase 
change of;r, corresponding to a path difference of A/2, between the 
ordinary and extraordinary rays for light of a particular wavelength. 

Suppose that the direction of linear polarisation of the light trans- 
mitted by the polariser is along OP (Fig. 3.28), at an angle a to the 
optic axis of the quartz plate. In traversing the quartz, two perpen- 
dicular components of amplitudes OR and RP are produced; on 
emergence from the quartz, the component of amplitude RP is 
changed in phase by n with respect to component OR; it therefore 
becomes directed along RQ. In traversing the glass, no such phase 
change is introduced. Consequently, the direction of linear polaris- 
ation of the light emerging from the quartz is along OQ at an angle 
2« to that emerging from the glass, which is still along OP. 

Rotation of the analyser cannot extinguish the plane-polarised 
light from the quartz half at the same time as it extinguishes that 
from the glass, because of the angle 2a between them. Suitable rota- 
tion of the analyser can, however, render both halves of the field 
of view equally illuminated. 



POLARISATION OF LIGHT 



135 



The effect of slight rotation of the half-shade plate is easily detec- 
ted; furthermore, when its orientation is set conveniently, rotation 
of the plane of polarisation in the instrument brought about by the 
inserted optically active material can be measured with high preci- 
sion. 

An alternative to the Laurent half-shade plate is a biquartz plate. 
This comprises two equal semicircular pieces of quartz joined along 
their mutual diameter, one of which rotates the plane of polarisation 
clockwise and the other anticlockwise. When the analysing Nicol 
is not correctly oriented, the two halves of the field of view appear 
unequally illuminated with monochromatic light or unequally col- 
oured with white light. The plates are cut to such a thickness that 
slight rotation of the analyser turns one half red and the other half 
blue if white light is used. 

Another type of half-shade device is the Cornu-Jellet prism, 
introduced by Jellet in 1860 and modified by Cornu in 1870. This is 
a Nicol prism from which a wedge of angle about 2\° has been re- 
moved and the prism recemented as shown in Fig. 3.29(a). Such 
a prism can be used as a polariser or an analyser. 



SHADED PORTION REMOVEO 




TWO HALVES RECEMENTED 




(0) 



DIRECTION Of 

VIBRATIONS 

TRANSMITTED 




IZ\ 



(b) 
Fig. 3.29. (a) The Cornu-Jellet prism; (b) the Lippich triple-field polariser 



136 UNIVERSITY OPTICS 

The Uppich triple-field polariser uses two subsidiary Nicol prisms 
N to provide a more convenient method of dividing the field of view 
into three parts [Fig. 3.29(b)]. The plane of polarisation transmitted 
by each Nicol N is slightly different from that transmitted by the 
polariser Nicol P. When this arrangement is viewed through a tele- 
scope with a short focal length objective, the field of view appears 
divided into three sections. When balanced, the two outer portions 
are made to have the same intensity of illumination as the middle 
portion. In this condition, the incident light must vibrate along 
a direction bisecting the angle between the directions of linear polar- 
isation transmitted by P and N. 



3.22.1 THE SENSITIVITY OF A POLARIMETER 

Suppose BP is the direction of the linearly polarised light transmit- 
ted by the polariser P(Fig. 3.30), and BN, at angle 20 to BP, is the 
direction of that transmitted by N (where N is a subsidiary Nicol or 
the quartz of a half-shade device). Equal illumination in the two 
halves of the field of view will occur when CN = CP, i.e. when the 
direction of the vibration transmitted by the analyser is along DBE, 
perpendicular to BC. The angle between BC and BP or BN is 0. 

The ratio of the intensities in the two sections of the field of view 
will be decided by BP^/BG 2 , which is unity for equal illumination, 
where PF and NG are both drawn perpendicular to DE. 

Let the analyser be turned through a small angle a, so that the 
direction of the vibration which it transmits is along D'BE', to pro- 
duce a just perceptible alteration in intensity. The transmitted 




Fig. 3.30. The sensitivity of a polarimeter 



POLARISATION OF LIGHT 137 

components are now BP cos ^.PBE and BN cos ^.NBD', which 
are BP sin (0 -a) and BN sin (9 +<x) respectively. As BP = BN the 
ratio of the intensities is sin 2 (0— a)/sin 2 (0+a). 

The eye can detect 1% difference in intensity. For a given value 
of 0, for this 1% difference, 

sin 2 (0-a) = 0-99 sin 2 (0+a) 
Therefore 

sin (0-a) = 0-995 sin (0+a) 
Thus 

sin cos a -cos sin a = 0-995(sin cos a + cos sin a) 

Therefore 

0-005 sin cos a = 1-995 cos sin a 
i.e. 

0-0025 tan = sin a = a 

because a is a small angle. Suppose = 2°, i.e. 0-035 rad, then the 
angle a is given by 

a = 0-0025X0-035 = 0-001 radapprox. 

Therefore a = 3'. 

With the data specified, a change of rotation of the analyser by 
3' can be detected as a 1% difference in illumination between the 
sections of the field of view in the analyser. 

3.22.2 THE CONCENTRATION OF A SUGAR SOLUTION 

The concentration of a sugar solution is measured with a saccharim- 
eter, which is a polarimeter having a scale calibrated to read sugar 
concentration directly. 

Modern saccharimeters do not depend on the rotation of a Nicol 
analyser to obtain equal illumination. Instead, a compensator intro- 
duced between the crossed Nicol prisms immediately in front of the 
analyser is used to restore equality of illumination in the field of 
view. This compensator (Fig. 3.31) consists of two wedge-shaped 
pieces of quartz, one dextrorotatory and the other laevorotatory. 
These wedges can be moved across the field of view by micrometer 
screws, so that altering the thickness of the quartz introduced coun- 
terbalances the rotation produced by the sugar solution. The sugar 
concentration is read directly on the linear micrometer scale. 



138 



UNIVERSITY OPTICS 

DEXTROROTATORY UEVOROTATORY 



MICROMETER 
SCREW 



Fig. 3.31. Compensator used as a saccharimeter 



3.23 Interference Effects in Uniaxial Crystals 

3.23.1 OPTIC AXIS OF CRYSTAL PARALLEL TO FACE AT WHICH LIGHT 
IS INCIDENT ALONG NORMAL 



A plane-parallel plate of the uniaxial crystal is mounted between 
crossed Nicol prisms [Fig. 3.32(a)]. If the incident light is white, the 
transmitted light is seen to have certain wavelengths missing. The 
values of these wavelengths can be calculated if the thickness of the 
plate and the difference between the two refractive indices at vari- 
ous wavelengths are known. 

The incident beam of plane-polarised light on the crystal plate is 
divided into two components plane-polarised at 90° to each other. 
These components travel through the crystal with different speeds. 
When they combine on emergence from the crystal plate, plane- 
polarised light will be reformed only if the two perpendicular com- 
ponents are in phase, and a complete cut-out of illumination at 
some wavelengths will occur only if both components are of equal 
amplitude. It is therefore usual to arrange for the plane of polarisa- 
tion of the incident light to be at 45° to the optic axis of the crystal 
plate. 

The ray of light AB [Fig. 3.32(b)] incident on the crystal plate is 
divided into two components having speeds v and v e ; as the plate is 
thin and the incidence is normal, the rays do not diverge appreci- 
ably, so C is close to D at emergence. The time of transit of the light 
of speed v in the plate is djv , and for speed v e it is d/v e , where d is 



POLARISATION OF LIGHT 



139 



WHITE 
LIGHT 





FIRST NICOL 

'V / 


CRYSTAL PLATE 

(OPTIC AXIS 

PARALLEL TO 

THE SURFACE) 


SECOND NICOL 








► TO PRISM 


-11 


x/ 










V 
► SPECTROMETER 

CERTAIN WAVELENGTHS 
MISSING FROM 
EMERGENT LIGHT 




POLARISER 


ANALYSER 



(a) 



YS 



lb) 




(c) 



Fig. 3.32. Study of interference effects in a unixial crystal between crossed polar- 
isers with normally incident collimated white light 



the plate thickness. The time difference t is therefore 

d 






(n -n e ) 



(3.11) 



where c is the speed of light in free space, n a is the refractive index 
of the crystal for the light of speed v (the ordinary ray), and n e is 
the refractive index for light of speed v e (the extraordinary ray). 

The period of the incident wave motion for a constituent of 
wavelength X is T where 

T= — 



The phase difference 6 corresponding to the time difference t is 

hence 2jtt/T = 2rcc//Arad. Substituting for t from Equation 3.11, 

the phase difference is thus 

. 2tic d , 

o = — 1 -X—(n -n e ) 



140 UNIVERSITY OPTICS 

i.e. 

6 = -^(„ _„ e ) (3.12) 

Wherever the phase difference equals 2np, where/? is an integer, the 
analyser will cut off the light, so this occurs when 

pX — d(n —n e ) (3.13) 

In the general case where the analyser is not crossed with respect 
to the polariser and the incident plane-polarised light is not at 45° 
to the optic axis of the crystal plate, let OX be the direction of the 
vibration of the ordinary ray in the crystal and OY that of the extra- 
ordinary ray [Fig. 3.32(c)]. The direction of vibration in plane-polar- 
ised light from the polariser (first Nicol) is along OP at angle a to 
OX, and for the analyser (second Nicol) along OA at angle /? to OX. 

Let the amplitude of the vibration along OP be taken to be unity 
for convenience. Assuming zero light absorption in the crystals and 
Nicol prisms, the amplitudes of the perpendicular components inci- 
dent on the analyser (second Nicol prism) are cos a along OX and 
sin a along OY. The amplitudes of these vibrations transmitted by 
the analyser are therefore cos x cos /J and sin a sin /?, with a phase 
difference of 5 between them. The intensity / of the resultant light 
emerging from the analyser is therefore given by 

/ = cos 2 a cos 2 /3+sin 2 a sin 2 /3 + 2 cos a cos /? sin a sin /S cos 6 

=*(cos a cos /3+sin a sin /?) 2 — 2 cos a cos /S sin a sin /?(1 —cos 5) 

Thus 

/ = cos 2 (a-/S)-sin 2a sin 2/5 sin 2 (5/2) (3.14) 

If the Nicol prisms are parallel, a = /S and (from Equation 3.14) 

/ = 1 -sin 2 2a sin 2 (6/2) (3.15) 

If the Nicol prisms are crossed, a. — (}±7i/2 and, from Equation 
3.14, 

/= sin 2 2a sin 2 (3/2) (3.16) 

To obtain maximum light intensity with crossed Nicols, Equation 
3.16 shows that a must be 45°; i.e. 

/ = sin 2 (5/2) (3.17) 

Then / will be zero when 

sin (5/2) = 
Thus / will be zero when 5 = 0, 27r, 4ji, 



POLARISATION OF LIGHT 

As for a plate of thickness d, from Equation 3.12, 



141 



Therefore 



or 



o = -j- (n -n e ) 



2nd 
2pn = -^-(n -n e ) 



X = — (n -n e ) 
P 



(3.18) 



where/; is an integer for dark bands; this is the same as Equa- 
tion 3.13. 

As an example, the use of a crystal plate of calcite is now consid- 
ered. The following data are available for calcite: 



Wavelength X (A) 


4,047 


4,861 


5,893 


6,563 


«o-« e 


0184 


0177 


0172 


0169 



If this calcite plate has a thickness of 00025 cm, the wavelengths 
missing from the transmitted light when the plate is between crossed 
Nicol prisms and the incident plane-polarised light is at 45° to the 
optic axis are, from Equation 3.18, where 



P = 



00025 («„-«,) 



From the data given, the values ofp calculated are: 
00025X0-184 00025X0177 



4-047 X10- 5 

0-0025X0172 
5-893 X 10- 5 



4-861 X10- 5 

00025X0169 
6-563 X10- 5 



These values ofp will not, in general, be integers. To find the inte- 
gral values at which component wavelengths will be absent, a graph 
of A against p is drawn for the four values of each, and the integral 
values of p determined from it; they will be found to lie between 
12 and 5. 



142 UNIVERSITY OPTICS 

3.23.2 INCIDENT DIVERGENT LIGHT 

Crystal plates viewed in plane-polarised collimated light in the 
direction normal to the optic axis show phenomena resembling 
interference, as explained in Section 3.23.1. When the crystal is cut 
so that the optic axis is in the direction of the chief ray of the inci- 
dent plane-polarised white light but this light is not collimated, a 




BLACK CENTRAL 
SPOT 



SOURCE OF PLANE-POLARISED 
LIGHT 



(a) 





§ w 



(b) 
Fig. 3.33. Crystal plate between crossed polarisers when the incident light is 
divergent: (a) arrangement of source and crystal plate (at P, there is an extra- 
ordinary ray but no ordinary ray) ; (b) appearance of central region of field of 
view beyond the analyser with incident monochromatic light 

beautiful series of coloured fringes of various forms appear crossed 
by dark crosses and brushes. 

The crystal plate is between crossed analyser and polariser, where 
the incident plane-polarised light from the polariser diverges in 
a cone from a point source S (Fig. 3.33). The axis of this cone is SO, 
parallel to the optic axis of the crystal. An oblique ray such as SQ 
will produce both ordinary and extraordinary components in the 



POLARISATION OF LIGHT 143 

crystal plate, these components being perpendicular to one another; 
the phase difference between these components will depend on the 
thickness of the crystal traversed, and hence on the obliquity of the 
ray. When this phase difference is an integral number of waves, the 
light will be extinguished by the analyser. With monochromatic 
incident light, a series of concentric dark rings will be seen in the 
field of view beyond the analyser. On these rings will be superim- 
posed a dark pattern along the directions OP and OP', where OP is 
parallel to the direction of vibration in the incident plane-polarised 
light and OP' is perpendicular to OP in the plane of the crystal 
face. The dark cross appears because, for any incident ray such as 
SQ, the extraordinary component in the crystal is along the radius 
OQ and the ordinary one is tangential to OQ. Along OP, the light 
travels in the crystal as an extraordinary ray without an ordinary 
component because the component tangential to the radius is 
zero in the incident light. Along OP', only the ordinary ray will 
appear in the crystal. 

For a particular fringe, the phase difference 5 between the two 
components depends on the wavelength X. If the incident light is 
white, colours will appear in the field of view beyond the analyser. 
The locus of a fringe of constant phase difference 6 is termed an 
isochromatic line, and there will be a corresponding isochromatic 
surface. 

If the incident light is convergent instead of divergent, similar 
arguments prevail in interpreting the fringe patterns observed in the 
field of view. 



3.24 Measurement of Strain by Polarisation Methods 

Some transparent materials, such as Xylonite, produce when 
stressed an optical effect proportional to the stress. The stress 
produced in such a material may be made a correct representation 
of the stress produced in another material not transparent to light 
(for example, a metal). 

The transparent material becomes doubly refracting under 
stress. If, therefore, it is required to study the stress distribution in 
some mechanical member when subject to a load (for example, the 
steel hook attached to a crane), a scale model of this member is 
made in the transparent plastic and subjected to stress by loading. 
Circularly polarised light is transmitted through this model, and 
the emergent light is viewed through a polariser. Wherever the 
path retardation of one component ray (the extraordinary ray, 
say) is A/4 compared with the other (the ordinary ray), a dark 
band will be seen provided that the incident circularly polarised 
light is monochromatic of wavelength A. The movement of a dark 



144 UNIVERSITY OPTICS 

band may be observed (by eye or photography) as the load is 
altered. Areas of high stress are shown by closely spaced dark 
bands. Modifications made to the structure, loading and support, 
simulated in the transparent model, will reveal when the stress 
and the corresponding strain are distributed more uniformly. 



3.24.1 PHOTOELASTIC ANALYSIS 

Photoelastic analysis, which is the further development of the 
polarisation techniques for measuring strain, is one of the most 
effective methods available for solving problems of stress distri- 
bution. It can be used either alone or as a check on mathematical 
calculations. The transparent material of which the model is made 
must be homogeneous. Monochromatic light is preferred as the 
stress lines are sharp enough even in the fifteenth order and consid- 
erable optical magnification is possible. White light produces the 
more picturesque colouredfringes, but these become indistinct above 
the sixth order. Path retardations brought about by stress can also 
be measured by the use of such devices as the Babinet and Soleil 
compensators in more sophisticated photoelastic procedures. 



3.25 The Kerr Electro-optical Effect 

Many transparent materials which do not normally exhibit double 
refraction will become doubly refracting and behave like a uni- 
axial crystal when they are subjected to an electrostatic field. This 
is because the field produces induced dipoles in a dielectric and 
increases the dipole moment of any existing polar molecules in the 
dielectric. These dipoles tend to be oriented in the direction of the 
electrostatic lines of force, and this renders the material anisotropic. 
This effect was discovered by Kerr, in 1895, using glass. The 
transparent dielectric is placed between the plates of a parallel- 
plate capacitor across which a potential difference is established. 
Light incident on the dielectric in the direction normal to the elec- 
trostatic field lines gives rise to an ordinary and an extraordinary 
ray in the dielectric. If «i and n 2 are the refractive indices for 
monochromatic light of which the directions of the electric vector 
vibrations are respectively parallel and perpendicular to the field 
lines, 

Wl -n 2 = H£ 2 (3.19) 

where k is called the Kerr constant for the material, A is the wavelength 
and E the electric field strength. 



POLARISATION OF LIGHT 145 

The Kerr constant k is particularly large for the liquid nitro- 
benzene. A Kerr cell consists of a container of nitrobenzene be- 
tween parallel electrodes and furnished with entrance and exit 
windows. When such a Kerr cell is set coaxially between crossed 
Nicol prisms, no light will be transmitted when the p.d. across the 
cell is zero. When a p.d. is established, light will be transmitted as 
the plane of polarisation of the light from the polariser is rotated 
within the Kerr cell to an extent depending on the electric field 
strength. Though for some solid dielectrics there is a delay of some 
seconds between the application of the field and the maximum 
rotation produced, this delay is only 10 -11 s or less for non-polar 
liquids. If, therefore, an alternating p.d. of frequency / is set up 
across a Kerr cell containing nitrobenzene, the light output from 
the analyser Nicol will be modulated at the same frequency/, which 
can be as high as 20 MHz. 

Detailed investigation of the light transmitted by the Kerr cell 
when plane-polarised light is incident upon it shows that the 
emergent light is elliptically polarised. Hence in no position of the 
analysing Nicol prism will the transmitted light be of zero intensity. 
The two components of the transmitted elliptically polarised light 
have an optical path difference between them of x, given by 

x = klEP-l (3.20) 

where / is the optical path length in the cell. 

In Equation 3.19, if £ is in volts per metre, then at a wavelength 
X of 5,893 A = 5-893 X 10~ 7 m (sodium light), the Kerr constant 
k is 5-2X10 -14 for water, 2-5X10 -12 for nitrobenzene and be- 
tween 3-2X 10 -16 and 1-7X 10 -15 for various glasses. 



3.26 Experimental Verification of Fresnel's Equations for Light 
Reflected at an Air-Glass Interface 



Fresnel's equations (Section 1.7) based on the electromagnetic 
theory of light give for light reflected at the interface between two 
different dielectric media the ratios A-u\A\x and Ai t \A\ T , where 
Ay and Ay are the amplitudes in the plane of incidence of the 
reflected and incident light beams respectively, whilst A ir and A\ r 
are the amplitudes perpendicular to the plane of incidence of the 
reflected and incident light beams respectively. Thus, Equation 
1.61 is 

A 2 i _ tan (I-R) 
An ~ tan (I+R) 



Therefore 



146 UNIVERSITY OPTICS 

where / is the angle of incidence and R is the angle of refraction ; 
Equation 1.60 is 

A 2r _ sin (I—R) 
~A^~ r _- sin(/+i?) 

From these two equations it follows that 

M L __ A sin (Z- R) tan(/+i?) 
A 2 i ~ lr sin (I+R) X A u tan (I-R) 

Ajr A lr cos(I-R) 

A 2 , A lt cos (I+R) l ' ; 

The ratio A-ufAy is the tangent of the angle a which the direction 
of the electric vector in the incident linearly polarised light makes 
with the plane of incidence; similarly A ar \A 2 i is the tangent of the 
angle 6 which the direction of the electric vector in the reflected 
light makes with the angle of incidence. 

In the experiment a prism is set up on a spectrometer adjusted for 
parallel light. The spectrometer telescope is furnished with an 
analyser Polaroid mounted in a divided circle immediately in 
front of its objective, and a similar polariser Polaroid is mounted 
immediately in front of the collimator lens. 

The procedure is first to determine the refractive index n of the 
glass of the prism by the well-known method of minimum deviation 
(the Polaroids being removed) using sodium light. The Brewster or 
polarising angle I B is given by tan -1 n (Section 3.2). The analyser 
Polaroid only is then placed in position before the telescope objec- 
tive. The telescope is rotated: (a) to receive light reflected from the 
prism face when the incident light is at the angle I B , and (b) to 
receive light refracted through the prism. In each instance, the 
Polaroid is turned until darkness is obtained in the field of view 
of the telescope eyepiece. In case (a), the direction of light vibration 
transmitted by the Polaroid is perpendicular to the electric vector 
A-u- As the plane of incidence in which A 2 i lies is horizontal in 
the experiment, the preferred vibration direction through the crossed 
Polaroid will be vertical. In case (b), the Polaroid vibration direction 
will be horizontal, crossed with respect to the vertical electric vector 
through the prism. 

The prism is then removed and the telescope set in line with the 
collimator. The second Polaroid (the polariser) is mounted in front 
of the collimator lens and rotated to give darkness in the telescope 
eyepiece, the polariser and analyser then being crossed. The reading 
on the divided circle mount of the polariser is noted, and this 
polariser is then turned through 45°. As the analyser Polaroid has 



POLARISATION OF LIGHT 



147 



not been rotated meanwhile, it follows that the polariser is trans- 
mitting light in which the electric vector is at 45° to the plane of 
incidence, so tan a = 1. At such a setting, An and Air are equal, 
and Equation 3.21 therefore becomes 

A 2r _ cos (/— R) 
~fei ~ ~ cos (I+R) 



(3.22) 



where tan = A 2r /A2i. 

The telescope is now rotated to receive light reflected from the 
prism face (Fig. 3.34). As tan 6 can be determined from the angle 
of setting of the analyser Polaroid, A 2 ,IA V can be found experi- 
mentally for various values of the angle of incidence /, enabling 
Equation 3.22 to be verified, which effectively verifies the Fresnel 
equations. 



OBJECTIVE t£NS OF TELESCOPE 
POLAROID (ANALYSER) 



LENS OF SPECTROMETER 
COLLIMATOR 




^POLAROID (POLARISER) 

Fig. 3.34. Experimental verification ofFresneVs equations 



Exercise 3 



1. Describe and explain two methods of producing a beam of plane polarised 
light. Describe in detail how you would proceed in order to determine the 
refractive index of an opaque solid dielectric which has a plane polished 
surface. (L.G.) 

2. Explain how plane polarised light in the visible region may be produced by 
(a) a Nicol prism, (b) a dichroic polariser, for example Polaroid, (c) reflection 
at glass surfaces. Consider carefully the advantages and disadvantages of 
each method. (L.G.) 

3. Describe two methds of producing plane polarised light in the laboratory, 
and explain how the state of polarisation of light may be determined experi- 
mentally. A beam of plane polarised light is incident on the polished surface 



148 UNIVERSITY OPTICS 

of a piece of black stone. When the angle of incidence is 60° no reflection 
takes place for a certain orientation of the beam. Discuss this statement. 
Estimate the refractive index of the material of the stone. (L. Anc.) 

4. What is meant by (a) the optic axis of a uniaxial crystal, (b) a quarter wave 
plate? Describe and explain how you would distinguish between natural 
light, plane polarised light, and circularly polarised light of a given wave- 
length. (L. Anc.) 

5. Explain what is meant by linearly, circularly and elliptically polarised light 
respectively. Describe how these may be distinguished experimentally. The 
refractive indices of crystalline quartz for a wavelength of 5893 A are 
1-544 for the ordinary ray and 1-553 for the extraordinary ray. Indicate on 
a diagram how a quarter wave quartz plate should be cut with respect to the 
optic axis, and calculate the least thickness of the plate. (L.G.) 

6. Explain how the state of polarisation of light from a given source may be 
investigated. 

A parallel-sided plate of thickness 0-050 cm is cut from a crystal so that the 
optic axis of the crystal lies on the plane surface of the plate. Plane polarised 
light of wavelength 5000 A is normally incident upon the plate so that the 
plane of polarisation makes an angle of 45° with the optic axis. Given that 
the ordinary and the extraordinary refractive indices of the material of the 
plate are 1-553 and 1-544 respectively, determine the state of polarisation of 
the light after transmission through the plate. (L. Anc.) 

7. Write an account of optical rotation (optical activity) including in this 
(a) a discussion of its explanation in terms of oppositely circularly polarised 
light, (b) a comparison of the effects observed in solids and liquids, and 
(c) a comparison of 'natural' optical activity with the rotation caused in glass 
by the presence of a magnetic field. (L.P.) 

8. Describe two methods by which polarised light can be obtained. How does 
unpolarised light differ from circularly polarised light? Discuss the use of 
polarising devices. (M.P.") 

9. Explain how you would determine the state of polarisation of a beam of 
monochromatic light. 

In the study of a horizontal beam of elliptically polarised light it is found that 
on inserting a quarter wave plate in a certain orientation, the emergent light 
is linearly polarised, the direction of vibration being vertical. When the 
quarter wave plate is rotated through 90°, the light is linearly polarised with 
the direction of vibration at 60° to the vertical. Determine the ratio of the 
axes of the ellipse and the angle that the major axis makes with the verti- 
cal. (L.P.) 

10. Explain the terms uniaxial and optically active as applied to a transparent 
crystal, indicating the structural features with which these properties are 
associated. 

The plane of vibration of a beam of plane polarised monochromatic light 
is rotated 90° (a) by passing it through an optically active uniaxial crystal 
in a direction parallel to the optic axis, (b) by passing it through a non- 
active uniaxial crystal perpendicular to the axis. In each case the crystal 
has the minimum thickness suitable for the purpose. What is the state of 



POLARISATION OF LIGHT 149 

polarisation of the light half-way through each crystal plate? Explain how 
you arrived at your conclusions. (L.P.) 

11. Explain the terms 'optical activity' and 'rotary dispersion'. The rotation per 
millimetre for quartz is given by 

e =-2.10x^ 4 

where g is in degrees and A in microns. 

If a plate of quartz is placed between polariser and analyser what is the 

minimum thickness of plate which will give a maximum transmission 

through the system at 5460 A? 

Calculate the thickness of quartz plate which will cause one quarter of the 

previous intensity to be transmitted at the same wavelength. (City) 

12. Describe and explain the action of some form of precision polarimeter. 
A thick plate of quartz with its faces cut parallel to the optic axis is placed 
between crossed Nicols with the faces perpendicular to a beam of white light 
which is passed through the system and into a spectrometer. A banded spec- 
trum is observed having dark bands with centres at wave numbers k 2266, 
2152, 2032, 1904, 1767, 1618 mm -1 . Show that this is consistent with a rota- 
tion of the plane of polarisation of the quartz proportional to A+Bk 2 where 
A and B are constants. (L.G.) 

13. A doubly refracting crystal plate with its optic axis parallel to the surface 
is placed between two crossed Nicol prisms. The optic axis of the plate makes 
an angle with the plane of transmission of the first Nicol prism. The system 
is illuminated with a parallel beam of white light, the transmitted light being 
analysed by a prism spectrometer. Sketch the optical arrangement. Explain 
what is observed, and how this depends on the plate thickness b.lfb = 0-025 
mm, 6 = 45° and the refractive index difference of the plate (/u e - fi,) varies 
with A in the manner shown in the table, what wavelengths are absent from 
the transmitted light in the range 4000-7000 A? 

A in A 4047 4861 5893 6563 

(.ft.-fi.) 0-184 0177 0-172 0169 (L.G.) 

14. Describe some form of polarimeter and its use for the accurate measurement 
of the rotation of the plane of polarisation by an optically active substance. 
Explain the action of a half shadow device in this connection. Discuss pre- 
cautions necessary in the experiment. (L. Anc.) 

15. What is a quarter wave plate and how is it used? 

In a Laurent polarimeter where a half wave plate is used to divide the field, 
the plane of vibration of the incident plane polarised light makes an angle 
of 5° with the optic axis of the half wave plate and the analysing Nicol is 
set in turn in the two positions giving equal brightness in the two parts of 
the field of view. What is the ratio of the intensities observed in these two 
positions? (L.G.) 

16. Describe some form of compensator such as that of Babinet or Soleil, stating 
how it could be used for the analysis of a beam of elliptically polarised light. 
On analysis a horizontal beam of right-handed elliptically polarised light is 
found to have a ratio of major to minor axis of V3/1, the former being at an 



150 UNIVERSITY OPTICS 

angle of 45° to the vertical. What angular measurements were made in 
obtaining this result? (L.P.) 

17. Explain what is meant by optical activity, and give Fresnel's explanation of 
the phenomenon. 

Describe in detail how you would measure to ±0-02° the optical rotation 
produced on a beam of plane polarised monochromatic light by a thick 
quartz plate cut perpendicular to the optic axis. (City) 

18. Give Fresnel's theory of the rotation of the plane of polarisation when light 
travels through an optically active medium. In what way does natural opti- 
cal activity differ from that produced by a magnetic field acting parallel to 
the direction of propagation? 

What is the rotation experienced by a beam of plane polarised light of 
wavelength 5000 A on passing through 10 cm of flint glass cylinder contained 
in a solenoid energised to give an axial field of 250 oersted, if the Verdet 
constant of the material for this wavelength is 0-032 minute of arc oersted -1 
cm -1 ? What is the difference in refractive index of the relevant circularly 
polarised components of the beam? 

(Alternative values of 200 amp-turn cm -1 and 0-04 minutes of arc per amp- 
turn may be used for the field and Verdet's constant respectively). , (L.P.) 

19. 'Optical activity is associated with asymmetry.' Discuss this statement when 
applied to the case of (a) crystalline solids, (b) liquids. Would you expect 
to observe such an effect in a gaseous medium? Describe how theFresnel 
theory accounts for the effect in terms of circularly polarised light. 

In an experiment to determine the specific rotatory power of sugar, a solu- 
tion of a certain concentration was placed in a tube between previously 
crossed nicols, using monochromatic light. Extinction was regained by 
rotating the analyser through 72° in a right-handed direction. In a second 
similar experiment using a solution of one-third the previous concentration 
a left-handed rotation through 36° was required. Deduce what you can 
concerning the actual rotation of the plane of polarisation in each case. 
What measurements should have been made in order to leave no ambigu- 
ities? (L.P.) 

20. Give an account of the theory of reflection and refraction of electromagnetic 
waves at the surface separating air from another medium such as glass, 
assumed to be a perfect dielectric. It will be sufficient to consider the case of 
waves polarised in the plane of incidence, i.e. with the electric intensity vec- 
tor perpendicular to this plane. Establish the formulae for the intensities of 
the reflected and transmitted waves, viz: 

Intensity of reflected wave [sin(r— ?')"| 2 



_ pin (r -p i' 
_ Lsin(r+i)J 



Intensity of incident wave Lsin (r+i)\ 

Intensity of transmitted wave _ f 2 sin r cos i "| 2 
Intensity of incident wave 



[2 sin r cos 1 1 2 
sin(r+i) J 



where i and r are the angles of incidence and refraction respectively. 
Explain, with an account of the apparatus used, how you would proceed to 
verify these results experimentally. (L.P.) 



CHAPTER 4 



Multiple-beam Interferometry 



4.1 Multiple-beam Interference in a Plane Parallel-sided Plate 

The interference of light involving division of amplitude in a paral- 
lel-sided transparent plate was considered in Chapter 9 of Volume 1 . 
It was assumed in this account that the surfaces of the plate had low 
reflection coefficients, so only two beams of light took part effec- 
tively in the interference phenomenon. If these surfaces have signif- 
icant reflection coefficients, multiple beams will be involved in the 
interference; moreover, the fringes obtained will be sharper not 
only in the reflected light but also greatly so in the transmitted light. 
This has led, in particular, to the Fabry-Perot interferometer or 
etalon which has become of great value, especially in very high reso- 
lution spectroscopy, and also the Lummer-Gehrcke interferometer 
or plate, which has a similar function though is less frequently en- 
countered in practice. 

Suppose a parallel-sided slab of uniform material of thickness 
d has its parallel surfaces silvered or aluminised to have a reflection 
coefficient r. In general, this material has a refractive index n; in 
important practical cases, the 'slab' is of air for which n = 1, to a 
high degree of accuracy. The air is enclosed between two plates of 
glass or silica by a spacer, and the inner surfaces of these plates are 
silvered or aluminised. 

Let an incident ray of monochromatic light of wavelength A and 
amplitude unity be incident at P on one surface at an angle 6, and 
where the corresponding angle of refraction is JR in the material of 
the plate or slab [Fig. 4.1(a)]. As multiple reflections occur, a series 
of parallel reflected and transmitted beams will arise. The trans- 
mitted beams can be brought to a focus by a suitable lens; the 
reflected beams could be likewise focused. 

151 



152 



UNIVERSITY OPTICS 



MULTIPLE REFLECTEO 

BEAMS OF AMPLITUDES 

SHOWN 




MULTIPLE TRANSMITTED 
^1 BEAMS OF AMPLITUDES 

'^1 SHOWN 



SURFACES EACH OF 
REFLECTION COEFFICIENT r 



(a) 




C 



Fig. 4.1. Multiple-beam interference in a plane parallel-sided slab 



The reflection coefficients r at each surface are the same. The 
transmission coefficient is t into the slab and *i out of it, where t and 
/i may be, but are not necessarily, equal. As the reflection coefficient 
is the intensity of the reflected light from a surface divided by the 
intensity of the incident light (and intensity is proportional to the 
square of the amplitude), it is easily seen that the amplitude of the 
successive parallel transmitted beams Ti, T 2 , T 3 , ... are \/(tt{), 
V(r 2 «i), vVi), • - • or VCi), r V(«i), rV(»i), ... The ampli- 
tudes of successive reflected beams are also easily seen to be \/r, 
V( rtt i), V'( r8 "i)» • • • The reflected beams are not considered here. 

The optical path difference l between successive transmitted 
beams such as Tx and Ti, is seen from Fig. 4. 1 (b) to be n{XY + YZ) — 
XN, where N is the foot of the perpendicular from Z on to XT\. 
As XY = YZ = d sec R, XN = XZ sin 6 = 2d tan R sin 6 and 
(sin 0)/(sin R) = n, "-"' 



/„ = 2ndsec R-2ndsin 2 RsecR=* 2nd sec R (1 -sin 2 R) 



MULTIPLE-BEAM INTERFEROMETRY 153 

Therefore 

/o = 2nd cos R (4.1) 

The corresponding phase difference is 

o = -y- 2nd cos R = — y— cos R (4.2) 

If the first transmitted beam is taken to have zero phase, since its 
amplitude is V(»i), it may be represented by the real part of 
V(th)exp)cot, where v = w/2jr is the frequency of the monochro- 
matic light. The successive beams T 2 , T 3 , T t , ... will therefore be 
represented by the real parts of r y/(jt{) expj(cot- &), r 2 V(«i) 
expj(o>/-2<5), /-V("i)expj(»/-3<5), .... 

When these parallel beams are brought together at a focus by 
a suitable lens, the resultant amplitude A at the focus will be decided 
by the sum of the complex series 

A = Re {V(«x) [1 + r exp ( - jd) + r 2 exp ( - 2jd) + . . . ]} 

Clearly, the summation of the infinite series of terms of ratio 
r exp (-jd) is involved. Therefore 



Re 



[T37 



V(th) 



exp(-j(5) 



The intensity is proportional to the square of the amplitude. The 
square of the modulus of a complex quantity is found by multiply- 
ing by the conjugate, which here is V(«i)/(1 -r exp j<5). Therefore 
the intensity / is proportional to 

th = th 

[l-rexp(-j,5)](l-/-expj<5) 1 - r[exp j(5 + exp( - jd)] + r 2 

Therefore 

/= kttl 

or 

kth 1 

(1 _ r f x i + [4r / (1 _ r)2] sin2 {d/2) (4.4) 

where A: is a constant. 

If t = t x , ttx may be replaced by P. Further, if it is assumed that 
there is no absorption of light in the reflecting surfaces and in the 
material, 

l-r 2 = / 2 (4.5) 



154 



UNIVERSITY OPTICS 



Equation 4.3 for the intensity will be a maximum (7 m ax) when 
cos 8 = 1, and a minimum (/ mjI1 ) when cos 8 = — 1. Maxima and 
minima will therefore occur at 8 
where p is an integer. Thus 



'max — 



2pn and (2p + l)n respectively, 
kth 



and 



(l-r)« 

kth 
(l+rf 



(4.6) 



(4.7) 



The variation of the intensity / (as a fraction of the maximum 
intensity 7 max ) with 8 for various values of the reflection coefficient 




Fig. 4.2. Variation of intensity with phase difference for various reflection coeffi- 
cients in fringes obtained on multiple-beam interference in transmitted light 
(the parameter is the coefficient r) 



r is shown in Fig. 4.2. It is here assumed that the maxima have the 
same intensity irrespective of the magnitude of r. This will not be 
so in practice, because increasing the reflection coefficient of a sur- 
face by the deposition in vacuo of a film of silver or aluminium can 
only result in a decrease of the transmission coefficients t and t\. 
Nevertheless, it is clear that the sharpness of the fringes increases 
with the reflection coefficient. 

The ratio 7 m ax//min is seen from Equations 4.6 and 4.7 to be 

which is independent of the transmission coefficients of the surfaces. 
The visibility V of the fringes is defined as 

'max ■'min *f 



V = ■ 



E + /n 



1 + r 2 



(4.9) 



MULTIPLE-BEAM INTERFEROMETRY 155 

and is also independent of the transmission coefficients. For an 
uncoated glass surface at normal incidence, r is 0-04. Then 

V = l+O-O* 8 = °' 08 appr0X - 

A glass surface coated with pure silver by evaporation in a very 
good vacuum can have a reflection coefficient of 0-94 for an opaque 
layer. A value of 0-8 is possible with reasonable transmission. The 
visibility of the fringes with r = 0-8 would be 

" l6 =0-975 



1+0-64 



Vacuum-deposited aluminium is preferred to silver, because it 
becomes coated with a tough adherent transparent film of alumi- 
nium oxide in air; its reflection coefficient is about 5% less than for 
silver for white light, but it is greater in the ultra-violet than that of 
silver. 



4.2 The Fabry-Perot Etalon and Interferometer 

The Fabry-Perot etalon consists of two plates of glass or quartz 
maintained in a holder with their facing accurately optically flat 
surfaces parallel and coated with a partially transmitting film of 
silver or aluminium having a reflection coefficient of about 0-8. 

There are two main types of etalon : one type has a fixed separa- 
tion (nominally 1 cm is frequently used, though other distances are 
available) between the reflecting surfaces on circular glass or silica 
plates separated by an accurately worked collar of fused silica ; in the 
other arrangement, one plate is fixed and the other is movable on 
a precision micrometer screw, so that the separation can be varied 
whilst maintaining the reflecting surfaces accurately plane-parallel. 

The Fabry-Perot etalon with a fixed separation has its inner plane 
parallel surfaces optically polished to be flat to a/50 or even A/100, 
where A is the wavelength of the light used. The separator is a collar 
of silica with three optically polished feet spaced at 120° intervals 
around each of the circular edges [Fig. 4.3(a)]. The planes decided 
by each set of three feet are made accurately parallel. The reflecting 
circular glass or quartz plates are pressed in a metal holder by ap- 
propriate springs [Fig. 4.3(b)] which are brought to bear one on each 
side of the parallel planes decided by the feet on the collar. The final 
adjustment of the plates to be accurately plane-parallel is achieved 
by rotation by hand of screws mounted with respect to the springs 
and bearing members. A monochromatic source of light is observed 
by eye through the etalon. If the eye is focused at infinity — a 



156 



UNIVERSITY OPTICS 




'-ANGLE OF WEDGE 
GREATLY EXAGGERATED 



(C) 

Fig. 4.3. The Fabry-Perot etalon with a fixed separation: (a) plan view of silica 
collar; (b) metal holder; (c) arrangement of plates 

condition achievable by practice — concentric circular fringes are 
discerned. A telescope can be used to aid the eye in this observation. 
Exact parallelism is achieved when a lateral movement of the eye 
(or of the etalon) does not cause the fringes to open out or close in. 
The screws on the etalon mount are adjusted until this condition 
is obtained. 

The outer unsilvered surfaces of the plates of the etalon are not 
parallel to one another but set so that the angle between them is 
from 1' to 10' [Fig. 4.3(c)]. This slight wedge shape given to each 
of the plates reduces the effect of the unwanted interference pat- 
tern which would be formed by these unsilvered outer faces. The 
plane-parallel 'slab' in which multiple-beam interference occurs is 
of air between the reflecting surfaces. The refractive index n is there- 
fore very nearly unity; in very high precision work, the etalon space 
can be evacuated to ensure that n is exactly unity. 

When two or more components of different wavelengths of 
a light source are observed by the Fabry-Perot etalon, equal-incli- 



MULTIPLE-BEAM INTERFEROMETRY 



157 



nation concentric circular interference fringes due to each compo- 
nent are separately produced and are distinct from one another. 

The optical path difference between neighbouring transmitted 
beams through the etalon is given by Equation 4.1 where n = 1. 
Multiple-beam interference in the transmitted parallel light occurs 
when these beams are brought to a focus by a lens, or on the retina 
of the relaxed normal human eye. Constructive interference will 
pccar between neighbouring rays when 



2d cos R = pX 



(4.10) 



where/* is an integer, X is the wavelength of the light and d the axial 
separation between the etalon plates. 

As the refractive index n = 1, the angle of refraction in the air 
between the plates will be the same as the angle of incidence and 
the angle of the emergent ray to the normal, presuming that the 
small difference between the refractive indexes of air and a vacuum 
is ignored. So the condition when 



2d cos 8 = pi 



(4.11) 



gives constructive interference. 

The Fabry-Perot etalon is set up so that monochromatic light 
from an extended source S (Fig. 4.4) is rendered parallel by a con- 
verging lens Li, traverses the etalon, and is then focused to a screen 
Z by a second converging lens L 2 - In practice, S is frequently the 
illuminated slit of a spectrograph, Z,i the collimator lens; this is 
followed by the etalon, then a prism (usually of the constant- 
deviation type— Section 5.17) and finally a camera lens L 2 and the 
screen or photographic plate Z. The prism separates components 
of the incident light, but with limited resolving power; the multiple- 
beam interference in the etalon enhances the effective resolution 
enormously. 

Considering first the extended source S (Fig. 4.4), it is seen that 
light from an axial point O on the source S will give an angle of 



REFLECTING SURFACES 
A 1 V A 




BRIGHT 

RING IN 

^INTERFENCE 

PATTERN 



Fig. 4.4. With an extended source, parallel light incident upon a Fabry-Perot 
etalon produces focused concentric circular interference fringes at a screen 



158 UNIVERSITY OPTICS 

incidence = 0° at the etalon. At such normal incidence, cos = 1 
and Equation 4.11 becomes 

2d = p X (4.12) 

where p<> is the order of interference for normal incidence. 

For any other point such as P on the extended source S, the angle 
at which the collimated light is incident upon the etalon has a finite 
value 0. Now Equation 4.11 prevails and, for a given X, construc- 
tive interference to give a maximum is when the order of inter- 
ference p is an integer which is clearly less than p . 

A given value of is obtained for points on a circle about the 
centre O of the extended source. The order of interference/? will be 
a constant for the angle 6 of emergence to the screen Z. The corre- 
sponding bright fringe (for constructive interference with p as an 
integer) will hence be a circle about the centre Q at the screen Z. 

A concentric series of such circular bright fringes will correspond 
to p at the centre (or very near the centre), p — 1 for the next fringe, 
Po—2 for the next further out, and so on. At the exact centre, where 
incidence is precisely normal, po is not necessarily an integer because 
2d is not necessarily an exactly integral number of wavelengths. 
Hence po will be the order of interference for a bright ring very near 
the centre where is very small (and so cos = 1, with very small 
error). At the centre proper, strictly speaking the order of inter- 
ference is pi+e, where s is a fraction; at this true centre there will 
therefore not, in general, be either a maximum or a minimum. 

The bright rings will be separated by dark circular fringes where 
destructive interference occurs and 

2d cos 6 = {p + \)X (4.13) 

Differentiation of/> in Equation 4.12 with respect to X gives 
dp _ d_ l2d\ _ 2d 
~dX ~ dX \T) ~ ~W 

Therefore 



or 





A 2 _ 2d 
dX dp 


X 

dX 


2d po 
dpoX dpo 



(4.14) 



The chromatic resolving power X/dX of the Fabry-Perot etalon 
depends on the smallest change of wavelength dX which can be 
detected at a mean wavelength X. From Equation 4.14, it is seen 
that this resolving power at the centre of the interference pattern is 
given by the smallest change of order po which can be recognised. 



MULTIPLE-BEAM INTERFEROMETRY 



159 



This will depend upon the sharpness of the circular interference 
fringes. From Equation 4.4, 



/ = 



l+[4r/(l-r) 2 ]sin 2 (<y2) 



(4.15) 



where I max is the intensity at the central maximum in the concentric 
interference ray pattern (order p ), and /is the intensity at a neigh- 
bouring position for light differing in phase by 8. 

According to the Rayleigh criterion (Chapter 12 of Volume 1), the 
two maxima are resolved if the intensity at the midway point be- 
tween two maxima is 8/sr 2 of the maximum intensity; the intensity 




0-405 



PHASE DIFFERENCE, » 

Fig. 4.5. The Rayleigh criterion for chromatic resolution 

contributed by one bright fringe then only falls off to 4/?t 2 at this 
midway point (Fig. 4.5). Though this criterion as given in Volume 1 
for a prism or diffraction grating does not strictly apply to the 
Fabry-Perot etalon because the peaks of intensity formed are consid- 
erably sharper, yet a simple— but pessimistic— estimate of the 
resolving power is given by putting 7// max = 4/rc 2 = 0-405 in Equa- 
tion 4.15: 



0-405 = 



1 



i+Fsin 2 (<5/2) 

where F = 4r/(l- r y> is known as the coefficient of finesse. There- 
fore 



Thus 



0-405 + 0-405F sin 2 (8/2) = 1 



sin 2 (6/2) = 



0-595 
0-405F 



1-469 



160 UNIVERSITY OPTICS 

or 



= 2 sin- 1 l/- 



469 



This phase difference d corresponds to CD in Fig. 4.5. The phase 
difference between neighbouring peaks A and B (i.e. CE) which can 
be recognised will be 



26 = 4 sin- 



As a change of order by unity corresponds to a phase change of 2ji, 
the smallest change dp of order p that can be recognised is there- 
fore given by 

4sin- 1 V(l-469/F) 

dpo = __ 

From Equation 4.14 the resolving power is therefore given by 

dA dp 2 sin- 1 (1-21/i/F) K ' 

With a reflection coefficient r of 0-8, 

F _ 4r _ 3-2 _ 
F - -(137)2 - "0^04 ~ 80 

Therefore y^ = 8-95 and 

. . 1-21 . 1 1-21 rt ,„. j 

sm Vf = w = 

This is small enough for the angle (in radians) to be taken instead 
of the sine. So Equation 4.16 can be approximated to 

pon\/F 



resolving power = 



2X1-21 



Substituting F = 4r/(l — r) 2 gives the resolving power as 

resolving power = - 1 ^ l n_ r) = fZr ( ^ 

It is seen that the resolving power is directly proportional to the 
maximum order of interference p and depends on the reflection 
coefficient r of the surface films. An etalon of large separation d 
and highly reflecting surfaces is therefore required for high resolu- 
tion. 



MULTIPLE-BEAM INTERFEROMETRY 



161 



If the etalon separation d is 1-0 cm and the reflection coefficient 
of each surface is 0-8, then for light of wavelength 5,000 A, 



2d 



= 4X10 4 



5X10- 5 

The resolving power is given by Equation 4.17 to be 
2-6x4xl0 4 Xv / 08 



X 
dX 



MX0-2 -2-75x10^^ 



At X = 5,000 A, therefore, it is possible to distinguish a change of 
wavelength dA equal to 5,000/(2-75 X 10*) = 0-0182 A. If an etalon 
of 10 cm separation were used, ten times this resolving power would 
be obtained, and wavelengths separated by only 000182 A in 
5,000 A would be discerned. Measurements of the hyperfine struc- 
ture of spectral lines can therefore be made. Such hyperfine struc- 
ture determinations can be used to determine the effects on atomic 
spectra of the Zeeman effect, the presence of isotopes, nuclear spin 
and nuclear quadrupole moments. 

If the etalon is used on a spectrograph between the prism and 
the collimator lens, the extended source of light will be a slit S, 
the prism of the spectrograph will produce separated lines at 
the camera plate, each line corresponding to the component wave- 
lengths in the source; each line will be crossed by interference 
bands due to the multiple-beam interference in the Fabry-Perot 
etalon (Fig. 4.6). 



1 n rt\ 



SOURCE OF LIGHT 
HAVING THREE MAIN 
COMPONENTS 




IMAGES OF SLIT S FOR 

LIGHT OF WAVELENGTH 

*, \, ANOXj 



FINE STRUCTURE OF LINES 
DETERMINED FROM 
INTERFERENCE RING 
DIAMETERS 



Fig. 4.6. Fabry-Perot etalon between prism and collimator lens of a spectrograph ■ 
(a) plan view; (b) appearance of photographic negative 



162 UNIVERSITY OPTICS 

4.3 The Method of Exact Fractions 

If the separation d of the reflecting surfaces of a Fabry-Perot etalon 
is known approximately, for example to be 1-018 cm, and the 
wavelengths of three spectral lines are known accurately, the sepa- 
ration d can be determined to a very high degree of accuracy by 
the Benoit method of exact fractions; other wavelengths can then 
be measured to a similar high degree of accuracy. 

For any wavelength Ai, the order of interference p at the centre 
of the Fabry-Perot fringes is not usually an integer so can be 
expressed as pi+ei, where p\ is the order for the first bright ring, 
/?i being an integer and ei being a fraction. For other wavelengths 
A2, A 3 , %t, ■■■ the corresponding values are p 2 , ps, Pi, ... for the 
integers and e 2 , e 3 , s lt ... for the fractions. As cos 6 in Equation 
4.11 is unity in each case 

2d = h(j>i + ei) = UP2 + ^2) = A 3 (/> 3 + e 3 ) (4.18) 

To measure these fractions e x , e 2 and e 3 at the centre of the 
fringe systems, the etalon is placed between the prism and the 
collimator lens of a prism spectrometer. If j is the bright rin^aum- 
ber (going out from the centre of the pattern) for a given wave- 
length and p is the order for the first bright ring, the order for the 
7th ring is p—j+ 1 . At this y'th ring, 8 will not be zero but equal to 
Oj, say. From Equation 4.11, therefore, 

2d cos Oj = (p -j + 1 )A 

where A will be Ai, A 2 or A 3 for e u e 2 or e 3 respectively. Substituting 

2d 

gives 

{p + e) cos 6j =p-j + \ (4. 19) 

As Oj is small, 

cos 8j = 1 -\ej 

The appearance in the telescope eyepiece or of the developed 
image on a photographic plate will be of a bright line, correspon- 
ding to a particular wavelength, crossed by the concentric circular 
interference fringes. The diameters of the bright rings at a given 
wavelength can be found by measurements with a travelling micro- 
scope on the photographic negative (Fig. 4.7). 

If Dj is the diameter of the yth ring for a given wavelength, 

KDj = 28 'j 



MULTIPLE-BEAM INTERFEROMETRY 



163 



Dj (DIAMETER 
OF jth RING) 

= *) 




L__i 



Oi (OIAMETER OF 
FIRST RING) 



CONCENTRIC CIRCULAR 
INTERFERENCE RINGS 
(ONLY THOSE SECTIONS 
WITHIN SLIT IMAGE APPEAR) 



SLIT 
IMAGE 



Fig. 4.7. An image of the slit for a particular wavelength is crossed by fringes 
which are short sections of concentric circular interference fringes 

where K is a constant depending on the magnification produced 
by the camera lens. Therefore 

cos6j=l-±K 2 D} (4.20) 

From Equations 4.19 and 4.20 it follows that 

Therefore 

Up+e)K 2 Df=j + e-l (4.21) 

When Dj = 0, j+s-l = so j = 1-e. Furthermore, a graph 
of Df against the ring number j is a straight line (Fig. 4.8) for 




3 4 5 6 
RING NUMBER, j 



Fig. 4.8. Plot of the square of the diameter of a bright interference ring (Df) 
against the ring number/ 



164 UNIVERSITY OPTICS 

which the intercept on they'-axis is 1 — e. The fraction e is therefore 
determined for a particular wavelength A. For accurately known 
wavelengths X\, A 2 and A 3 , the fractions £1, e 2 and e 3 are found 
in this manner, each to an accuracy of ±0-01. 

For the separation d of, say, nominally 1-018 cm, 2d/Xi gives the 
integer p x approximately. At this wavelength, ei is known. Hence 
fa(Pi + £i) in Equation 4.18 is apparently known. As A 2 is also 
known, it is possible to calculate /; 2 +£ 2 from the relationship 

h(pi + e{) = ,k(P2 + e 2) 

The fraction e 2 is then found because p 2 is necessarily integral; 
if this calculated value of e 2 is different from that obtained experi- 
mentally (i.e. leading to a plot such as Fig. 4.8), the integral value 
of p\ must be wrong. The procedure is then to find the correct 
value of the integer p\ by exploring the range of values differing 
successively by +1, +2, +3, . . ., -vp' and - 1, -2, -3, . . ., -p' 
from the originally calculated value, where p' is an integer of 
20-30 usually. Each time the corresponding value of e 2 is calculated. 
When this value is the same as that determined experimentally at 
the wavelength A 2 , then the correct integer p ± is known. To make 
sure, a third wavelength A 3 is also explored in the same way. 
Now pi+ex is known precisely for the known wavelength X u 
where p\ is an exactly known integer and the fraction ei is known to 
within 0-01. From the equation 

the separation dis thus found to within about 10 -7 cm. 

For example, if dis normally 1-018 cm and Ai is exactly 5,000 A, 
P! is initially estimated to be 2-036/(5 XlO -5 ) = 40,720. If it is 
eventually found by the procedure given that pi is, in fact, 40,728 
whilst ei is obtained by experiment to be 0-03, it follows that 
Pi+e! is 40,728-03, and the exact length of the etalon is thus 

d= ix 5 X10- 5 X 40,728-03 = 1-01820075 cm 

to an accuracy of better than 1 part in 10 7 . 

Once the etalon separation d is found to this order of accuracy 
by use of the method of exact fractions, it is possible to determine 
with the same precision other wavelengths previously known only 
approximately. 



4.4 The Fabry-Perot Etalon with Variable Separation 

One accurately plane glass plate coated with silver or aluminium 
is fixed and the other is mounted on a nut on a micrometer screw 
so that it can be moved to alter the separation d. A convenient 



MULTIPLE-BEAM INTERFEROMETRY 165 

variable-separation etalon can be set up by removing the two full 
reflecting mirrors from a Michelson interferometer (Volume 1, 
Chapter 10) and setting up two partially reflecting mirrors parallel 
to each other with their silvered surfaces facing; one of these 
partially reflecting mirrors is fixed and the other is on the movable 
carriage controlled by the micrometer screw. Indeed, some com- 
mercial instruments are arranged so that they can be set up either 
as a Michelson interferometer or as a variable-thickness Fabry- 
Perot etalon. 

The observer can view by eye, or with the aid of a telescope, 
a source of light directly through the two silvered plates and 
study the positions of the concentric circular interference fringes 
as the separation between the plates (i.e. the thickness or length of 
the etalon) is varied. 

Suppose that there are two components, of wavelengths Ai and 
A 2 , in the light from the source and that Xi =► 2. 2 . When the reflecting 
surfaces are nearly in contact (separation d is very small), the ring 
systems due to the two different wavelengths will almost coincide. 
As the separation d is increased, the rings separate. It is possible 
to judge when one set of rings is half-way between those of the 
other set. As d is then further increased, a position will be reached 
where the two ring systems coincide exactly, then separate out 
beyond this position to give a second half-way position and a 
second coincidence, and so on as d is increased progressively. 

Let the separation be rfi when the bright rings (i.e. the maxima) 
coincide in a direction near the centre of the field of view where 6 
is small; then 



Therefore 



2d\ cos = pik\ = qi& 2 



h _ gi 
h Pi 



where p x and q ± are the respective integral orders of interference. 

At the next bright ring coincidence further from the centre, for 
the same separation dx but where 6 has increased, pi becomes 
Pi—l and q x becomes qi—l; but(qi-l)/(pi—l) is very nearly 
equal to qi/pi becausepi and q\ are both very large numbers. Con- 
sequently, the coincidence of bright rings will be observed over the 
whole field of view. 

The plate separation is then increased to d 2 at which the bright 
ring maxima due to X x and A 2 again coincide. This is achieved 
when pi is increased to p% and q x to q 2 , so 

2rfa cos 9 = p 2 \ x = q 2 %2 



166 UNIVERSITY OPTICS 

Therefore 

2{d 1 -d 2 ) cos = (p r -p 2 )h = (qi-q 2 )A 2 

Clearly, if pi—pz — m, q\— q 2 must equal m+l, where m is an 
integer. Therefore 

2(di — d 2 ) cos = mix == (/n+l)A 2 = /wA 2 + A 2 

Therefore 

A 1 -A 2 ==— (4.22) 

m 

Asm = p\—p 2 — 2{dx—d 2 )jXx near the centre of the fringe system 
when 6—0 and cos = 1, 

If Ax— A 2 = di, where cU is very small, which is frequently the 
situation of interest, Equation 4.23 becomes 

dA = ™^T5 ( 424 > 

The possible procedures in practice are: 

1. The number of fringes at wavelength X\ or X 2 which appear 
from the centre of the field of view (for example, which pass 
the cross-wires in a viewing telescope eyepiece) is counted as 
the movable plate is moved from one coincidence of bright rings 
to the rest over the distance d\ to d 2 . This number is m in 
Equation 4.22, so Xi/X 2 is known, or A 2 is found if Xi is 
known. 

2. If the micrometer screw is accurately calibrated in centi- 
metres, the traverse d 2 —d\ is directly measured and 
Equation 4.23 gives Ai— A 2 , accurately provided that X± and 
X 2 are known approximately. 



4.5 Brewster's Fringes: Two Fabry-Perot Etalons in Series 

As has already been shown, with a single parallel plate a given 
interference fringe corresponds to a given value of cos ; thus 
constant inclination fringes are formed which are at infinity and 
can be focused by an objective lens to give concentric alternately 
bright and dark rings in the focal plane of the lens. With two 
such parallel plates inclined at a small angle 2a to each other, 
Brewster's fringes are obtained in the focal plane of an objective 
lens when an extended source of light is viewed by transmission. 



MULTIPLE-BEAM INTERFEROMETRY 



167 



Suppose the two plates are of the same thickness d and have 
semi-silvered surfaces. In the case of chief interest, these 'plates' 
both consist of the air enclosed between the facing semi-silvered 
surfaces of an etalon; thus two etalons in series are involved, 
and the refractive index n of the medium is unity (Fig. 4.9).* 




Fig. 4.9. Brewster fringes formed on the transmission of light through\two etalons 

in series and inclined at a small angle to each other: (a) plan view of etalons; 

(b) vertical elevation 



Consider first horizontal rays such as a and b parallel to the 
axis (which is best defined as the axis of a viewing telescope, of 
which lens L is the objective). The etalons 1 and 2 are inclined at 
the small angle 2x and are symmetrically disposed about the 
normal to the axis in the plan view shown in Fig. 4.9(a). The 
angles of incidence of both rays a and b at etalons 1 and 2 will 
then be a. 

The ray a has an additional path of 2d cos « in etalon 2, and the 
ray b has an additional path of 2d cos a in etalon 1. All such 
parallel rays can be divided into pairs, one of type a and the other 
of type b. In general, for a ray of type a there will be an even 



168 UNIVERSITY OPTICS 

number aw of reflections in etalon 1 and m + 2 reflections in etalon 2 ; 
for a ray of type b, there will be m+2 reflections in etalon 1 and 
m in etalon 2. In the case illustrated in Fig. 4.9(a), m is taken to be 
zero for simplicity. 

For horizontal parallel rays such as c and d inclined at an angle 
6 to the axis, the angle of incidence will be 6— a at etalon 1 and 
6 +x at etalon 2. For a ray of type c (which corresponds to type a) 
the additional path in etalon 2 is now 2d cos (d +«), whereas for a 
ray of type d (which corresponds to type b) there is a different 
additional path of 2d cos (0— a). The ray pairs such as c and d 
hence arrive at the focal plane of lens L with a phase difference 8 
given by 

8 = -y-2J[cos (0 — a)— cos(0 + a)] = — y- sin sin a 

As expected, this is zero when 6 =- 0, i.e. when the incident hori- 
zontal rays are parallel to the axis as for rays a and b. 

From the extended source of light S, rays will not, of course, 
all be in or near the horizontal. Consider, therefore, the general 
case of rays which are inclined at angle <j) to the horizontal and at 6 
to the vertical plane through the axis. The phase difference A be- 
tween pairs such as c and d in Fig. 4.9(b) (not now in the hori- 
zontal plane containing the axis of the telescope) will be 8 cos <j>, 
i.e. 

A = d cos 4> = — = — sin sin a cos <p (4.25) 

where A is the wavelength of the light from the extended source S. 

Incident rays which lie in vertical planes parallel to the axis but 
which are at angle $ to the axis of the telescope will be those for 
which = 0, so will arrive in phase at the focal plane of the tele- 
scope objective. For any particular value of <f>, there will therefore 
be a bright point in the focal plane of lens L. For the range of 
values of <j> for an extended source, the series of bright points will 
form a central vertical bright line in the focal plane. This will be 
observed both for monochromatic; light and also for white light 
because A = 0. 

Incident rays for which is finite but with different values of <j) 
will correspondingly be focused in a vertical line in the focal 
plane of lens L; however, there will now be a specific phase differ- 
ence A between pairs of rays, depending on both 8 and <f>. In the 
usual practical situation, is small and cos <f> === 1, so that variation 
with $ is not significant. The focal plane of lens L, i.e. the field of 
view in a telescope of which L is the objective, will therefore be 
illuminated by a series of vertical interference fringes, known as 
Brewster's fringes. 



MULTIPLE-BEAM INTERFEROMETRY 169 

Maxima, i.e. bright vertical lines, will appear in this system of 
interference fringes when A = 2pn or zero, where p is an integer. 
As </> is small in Equation 4.25, cos <j> can be put equal to unity; 
thus maxima occur when 

8nd . n . 

—. — sin 6 sin a = 2/?;t or zero (4.26) 

If the angles 6 and « are also small, as is usual with Fabry-Perot 
etalons, 

The angular separation of the maxima (bright line fringes) is there- 
fore A/4dx. 

To summarise, Brewster's fringes are obtained on viewing an 
extended source through two equal parallel plates in series, inclined 
at a small angle to one another. As observed in a telescope, these 
fringes consist of parallel bright lines interleaved by dark fringes 
which are vertical when the angle of inclination between the plates 
is in the horizontal plane. If the extended source of light is white, 
a central maximum bright white line flanked by coloured fringes is 
observed. With monochromatic light, a series of alternately light 
and dark fringes is observed for low orders of interference. At high 
orders, the fringes become curved. 

With two Fabry-Perot etalons in series, the 'plate' or 'slab' 
is now simply of air. Brewster's fringes are seen if these two 
etalons are of the same thickness d, and also if the ratio of their 
thicknesses d x and d 2 is 2 : 1 or, in general, where di/d 2 equals the 
ratio of small integers. 



4.6 Comparison of the Standard Metre with the Wavelength of 
Light by means of Fabry-Perot Etalons 

In Volume 1 (Section 10.3) is described the determination of the 
wavelength of the red and other lines in the spectrum of a cadmium 
source in terms of the accepted standard metre. This pioneer work 
made use of the Michelson interferometer. Later work has utilised 
Fabry-Perot etalons. This was first undertaken in 1913 by Benoit, 
Fabry and Perot. More recent work is due to Sears and Barrell, 
and at the present time the red cadmium line (A = 6,440 A) has 
been discarded as the standard in favour of that from a discharge 
lamp containing the isotope of krypton of mass 86 (A = 6,060 A), 
which exhibits less hyperfine structure than the cadmium red. 

To compare wavelengths with mechanical standards of length, 
of which the accepted international standard metre is the most 



170 UNIVERSITY OPTICS 

important. Benoit, Fabry and Perot used five Fabry-Perot etalons 
(etalons 1, 2, 3, 4 and 5) of lengths (reflecting surface separations) 
100 cm, 50 cm, 25 cm, 12-5 cm and 6-25 cm respectively. Sears and 
Barrell used only three etalons: etalon 1 of 1 m, etalon 2 of -^ m 
and etalon 3 of y m, where these: lengths are precise as regards 
normal engineering standards but are nominal in relation to high- 
precision optical measurements. 

In both series of experiments, the length of the shortest etalon 
was determined by the method of exact fractions using the red 
cadmium line (later the orange-red line of krypton 86); the number 
of wavelengths accommodated within the length of the etalon was 
thus found precisely. In the Fabry, Perot and Benoit procedure, 
the first part of the experiment was followed by a second part, 
in which the ratios of the lengths of the following etalon combi- 
nations were found: etalon 4/etalon 5; etalon 3/etalon 4; etalon 
2/etalon 3; and etalon 1 /etalon 2. To do this, specially calibrated 
thin air wedges between silvered glass plates were used. In the 
Sears and Barrell experiments, the ratios of the lengths of etalons 
2 and 3 and of the lengths of etalons 1 and 2 were determined by 
a method depending on the observation of Brewster fringes 
(Section 4.5). The third and final part of both experiments was then 
to compare the longest etalon (etalon 1) of nominally 100 cm (but 
now of exactly known length in terms of the wavelenth of the light 
used) with the standard metre. 

In both experiments, the apparatus was set up* so that the obser- 
vation of the order of interference at the centre of the shortest 
etalon, the measurement of the ring diameters needed for the 
subsequent exact fraction calculation, the intercomparisons of the 
etalon lengths and the final comparison between the 100 cm etalon 
and the standard metre could all be made as quickly as possible. 
This was essential to avoid errors due to changes in atmospheric 
conditions. In the experiments of Sears and Barrell, the whole 
optical arrangement was maintained at constant temperature 
measured to an accuracy of 0-001 deg C; each etalon was specially 
constructed so that the space between its reflecting surfaces could 
be evacuated to low pressure so as to avoid errors resulting from 
the fact that the refractive index of air is not exactly unity. 

The mean value of several measurements of the wavelength of 
the red line in the cadmium spectrum in terms of the standard 
metre is 6,438-4696 A. 

In 1960, the International Commission of Weights and Measures 
adopted the orange-red line of krypton 86 as the standard and de- 
fined the metre as 1,650,763-73 wavelengths in vacuo of this line. 

* For fuller details see, for example: Longhurst, R. S., Geometrical and 
Physical Optics, Longmans, Green, London (1957). 



MULTIPLE-BEAM INTERFEROMETRY 171 

This line in the spectrum of light from a krypton 86 lamp is there- 
fore the basic standard of length from which the metre is defined 
and, in particular, it is the reference wavelength in terms of which 
all other wavelengths in the electromagnetic spectrum are specified. 

4.7 Interference Filters of the Fabry-Perot Type 

Interference filters which transmit substantially monochromatic 
light with a much sharper cut-off than is possible with gelatine or 
coloured glass filters are made by vacuum-coating techniques. For 
example, a semi-transparent film of silver or aluminium is first depos- 
ited on a plane glass plate, on top of this is deposited a layer of 
a transparent dielectric such as cryolite, and finally a second semi- 
transparent film of silver or aluminium. 

If d is the thickness of the dielectric film of refractive index n 
sandwiched between the reflecting metallic films, a Fabry-Perot 
etalon of very short length is produced. When a beam of collimated 
light is incident normally, maxima in the transmitted light will occur 
for those wavelengths X which satisfy the equation 

2nd = pX 
where p is an integer. 

Cryolite evaporated in a vacuum will produce a durable trans- 
parent film, of refractive index about 1-34, on a substrate If its 
thickness d is small, say 4x 10"« cm, comparable with visible light 
wavelengths, then s 

2X1-34X4X10-5 =/7A 

F - P n 1 h i TJV- 2X 10 " 5 Cm = 10 ' 720 A in the infra-red; for 
P~ 4/ = 5,360 A in the visible and for p = 3 or more, X is in the 
. tr f; V1 £r f egl °?- Consequently, the only visible light transmitted 
is at 5,360 A in the green. 

An interference filter with green transmission is thus available 
s%S?T'l the hght transmitted will not only be of wavelength 
3 360 A because there will be a distribution of light in wavelengths 
about the wavelength at the central maximum to an extent depen- 
dent upon the resolving power of the etalon (Fig. 4.5). From the 
analysis given in Section 4.2, it is seen that the intensity falls off 
to approximately 40% of that at the central maximum at wave- 
lengths X-dX and X+dX where, from Equation 4.17, 

A _ 2-6p y> 
dX ~ \- r 
For r = 0-9 

dX = °' n = °-° 475 ^ 

2-6/>oXO-81 p 



172 UNIVERSITY OPTICS 

For the cryolite film of thickness 4X 10 -5 cm, A = 5-36 X 10" 5 cm 
and pa = 2, as shown above. Therefore, 

ctt = °-° 475X5 2 36Xl( ^ = 1-275 X 10- cm 

Therefore, with a central maximum Z max at 5,360 A, light of intensity 
0-4 7 max will be transmitted at wavelengths (5,360± 127-5) A. 

The width of the band of wavelengths passed by such an inter- 
ference filter can be reduced to about 100 A by increasing the reflec- 
tion coefficient of the metallic surfaces to 0-94. It is then difficult to 
obtain sufficient transmitted light because, with metallic films, high 
reflectivity can only be obtained by thicker deposits of high light 
absorption. 

This difficulty can be overcome by using enhanced reflecting 
layers formed by the vacuum deposition of transparent dielectrics 
on to glass instead of reflecting metallic films. In Section 9.4 of 
Volume 1, it is shown that a layer of transparent dielectric A/4 thick 
on glass will produce enhanced reflection if this dielectric has a re- 
fractive index greater than that of the glass substrate. Suitable high 
refractive index dielectrics for this purpose are zinc sulphide, for 
which n = 2-3, and titanium dioxide of refractive index n = 2-8 at 
a wavelength 5,000 A. 

Interference filters of low absorption and good transmission of 
light can therefore be made by the deposition in vacuo on to a glass 
plate (which does not have to be optically flat because the deposited 
film follows contours accurately) of first a dielectric, such as titani- 
um dioxide, to a thickness of A/4 (where A is the desired transmitted 
wavelength), then a thin layer of cryolite or, alternatively, magnesi- 
um fluoride, and finally a second layer A/4 thick of titanium dioxide. 

To obtain higher reflection coefficients than are possible with 
only one dielectric film (and so narrow down the band of wavelengths 
transmitted by the filter), dielectric films of alternately high and 
low refractive indices are vacuum deposited. With seven layers of 
zinc sulphide and cryolite, the reflection coefficient is 0-9. 

The modern interference filter able to transmit a narrow band 
of wavelengths about a predetermined specific wavelength is a valu- 
able device in optics, with considerable advantages in defined band- 
pass over the coloured gelatine filter. In recent developments, the 
vacuum coating procedure is programmed to give several layers 
of dielectric materials of controlled thicknesses. These layers are 
sandwiched between glass plates for protection, and the outer sur- 
faces of the glass plates are bloomed (see Section 9.4 of Volume 1) 
to enhance light transmission and reduce back-reflection. 

Reflection filters are also available: a dielectric layer of, say, 
cryolite is vacuum-deposited on to a mirror consisting of, for 



MULTIPLE-BEAM INTERFEROMETRY 



173 



example, glass vacuum-coated with aluminium. A semi-transparent 
metallic film is then deposited on the dielectric. The mirror so made 
can be designed to have zero reflectivity at specific wavelengths. 



4.8 The Lummer-Gehrcke Interferometer 

The resolving power of a multiple-beam interferometer increases 
with the reflecting power of the separated surfaces. In 1901, Lum- 
mer devised a method of making an interferometer in which high 
reflection was obtained by utilising light internally reflected near 
the critical angle; in collaboration with Gehrcke, a new interferom- 
eter based on the principle was introduced in 1903. The reflecting 
surfaces did not require silvering or aluminising. 

The Lummer-Gehrcke interferometer consists of an accurately 
plane-parallel plate of glass, or preferably quartz, with a small right- 
angled prism of the same material in optical contact at one end 
(Fig. 4.10). Light incident normally on the upright plane face of 
this prism is reflected at its hypotenuse face. The reflected light 
passes into the plane-parallel plate to be incident internally on its 
surface with air at an angle slightly less than the critical angle. 
The prism angle is predetermined to achieve this. At each successive 
incidence down the length of the plane-parallel plate, light is reflec- 
ted and partially transmitted. The transmitted parallel beams are 
brought to a focus by a lens L. 

The Lummer-Gehrcke plate is an alternative to the Fahry-Perot 
etalon, though it is little used nowadays because the latter has 
proved more versatile and is rather easier to construct. Like the 
Fabry-Perot etalon, it is usually set up between the collimator and 
prism on a constant-deviation spectrometer. 

Multiple beam interference fringes of equal inclination are ob- 
tained in the focal plane of lens L. Two sets of identical fringe sys- 
tems are produced symmetrically about the two sides of the plane- 
parallel plate. 



RIGHT-ANGLED PRISM 




OPTICAL 

CONTACT | \ 

[REFRACTIVE INDEX n 

1- 



APPEARANCE 

OF 

FRINGES 



FOCAL PLANE 
OF L 



Fig. 4.10. The Lummer-Gehrcke interferometer 



174 UNIVERSI1Y OPTICS 

If « is the refractive index of the material of the plate of thick- 
ness d, maxima in the interference pattern occur when 

2nd cos 9 =pX (4.28) 

p being an integer. Let <j) be the angle that the emergent light makes 
with the normal; then 

sin <b 

sinfl 

Substituting for cos in Equation 4.28, 



«/(!-*♦)-* 



Therefore, 

2rfV(" 2 -sin 2 4>) = pi (4.29) 

or 

sin 2 <j> = « 2 -^ ( 4 - 3 °) 

To explore the variation of <\> with order p (where n, d and A are 
constant), differentiation of Equation 4.30 with respect to <j> gives 

„ . ± j. A* -, dp pA 2 dp 
2 sin * cos 4> = -w 2p d$ = "23* # 

The angle A0 between orders differing by Ap is therefore given by 

pA 2 Ap 



A<t> =- 



2d 2 sin 2<£ 



where Ap = 1 for successive orders. Substituting for pX from Equa- 
tion 4.29, the angle d<j> between successive orders is seen to be 

^4-sm 2 ^) 
^ d sin 2<p 

The separation between successive orders therefore increases with 
wavelength, varies inversely with the plate thickness rfand increases 
as <f> — 90° (i.e. as grazing emergence is approached). Note that the 
length of the plate does not enter into this analysis: the separation 
is independent of this length. 

For a given order p, the dispersion 90/5A is obtained by differen- 
tiating Equation 4.30 with respect to A. Taking into account that the 
refractive index n is a function of wavelength A, 

. „.30 . dn 2p 2 A 
s,n2^ M = 2Hg X - 1 ^ 



MULTIPLE-BEAM INTERFEROMETRY 175 

Therefore 

80 _ 4d 2 n(dn/dX)-p*X 
3A ~ 2cP sin 20 

Substituting for d from Equation 4.29 gives 

90 _ 2A/?(8w/3A)-2(w 2 -sin 2 0) 

U X sin 20 (4 - 32) 

The dispersion of the interferometer is therefore independent of the 
length and thickness of the plate, but it does depend on the disper- 
sion dn/dX of the material of the plate, the wavelength and the angle 
of emergence 0. 

A value of 6<j> obtained from Equation 4.31 and equated to the 
same value of 30 from Equation 4.32 will give bX, the maximum 
difference of wavelength possible without overlap between succes- 
sive orders. Thus, 

8<b - X V( n2 ~ si ^ 2 4>) 2A«(8«/8A)-2(»a-sin 2 0) 

dsin 20 X sin 20 bl 

Therefore 

^ _ AV(« 2 -sin 2 4>) 

2d[n 2 -sin 2 <l>-Xn(dn/dX)] 
As vV-sin 2 0) = pXjld, from Equation 4.29, 
8k = pX»/2d 

2dl{p*X*/4cP)-Xn(dn/M)] 



or 



M- — P» 



p 2 X-4d s n(dn/SX) (433) 

giving 8X, the difference of wavelength which a line in any order 
must have to coincide with the main fringe in the next order. 

Equation 4.33 also applies to a Fabry-Perot etalon, for which 
dn/dX = (presuming the space between the etalon reflecting sur- 
faces is evacuated to a low pressure). 

The resolving power of the Lummer-Gehrcke plate interferom- 
eter may be estimated approximately by noting from Fig. 4.10 
that the width of the beam entering the telescope or camera objec- 
tive lens from the top (or bottom) surface of the plate is / cos 
where / is the total length of the plate. The angle d0 between the 
central maximum and the first minimum of a beam of this width is 
X/(l cos 0), by comparison with diffraction at a slit. In accordance 
with the Rayleigh criterion this is the smallest angle resolvable. 



176 UNIVERSITY OPTICS 

From Equation 4.32, 

J ± Aw(8w/9A) - n 2 + sin 2 <f> , , 

am = -. — ; — 7 a/ 

/ sin (j) cos q> 

Put d<j) - A/(/ cos <j>), 

A A«(8n/3A)-n 2 +sin 2 (ft 

/ cos 4> A sin cos (f> 

The resolving power A/dA is hence given by 



<U 



1 l (hi ~ - n 2 + sin 2 A (4.34) 



dA A sin <p \ 8A 



Though useful, Equation 4.34 for the resolving power of a Lum- 
mer-Gehrcke interferometer is not strictly correct for two reasons : 

1. The amplitude is only uniform over the wavefront entering the 
objective lens for grazing emergence of the light, which is only 
approachable in practice. 

2. To calculate the resolving power exactly demands, as in the 
Fabry-Perot interferometer, an evaluation of the intensities 
in the parallel emergent beams. This computation is not easy 
because the series to be summed in adding up the successively 
decreasing intensities for the rays in the beam is not an infinite 
one as it is for the Fabry-Perot interferometer. 

A useful approximation to the resolving power is available, how- 
ever, from Equation 4.34 by putting (j> = 90° (so sin <j> = 1) and 
omitting the dispersion term 8m/8A, which makes a correction of 
only about 7%. The resolving power is therefore approximately 
evaluated from the equation 

X I 
resolving power — -— = -- (n 2 — 1) (4.35) 

The resolving power can therefore be taken to be between one and 
n 2 — 1 times the length of the plate measured in wavelengths. 

The Lummer-Gehrcke plate is difficult to make and use because 
the surfaces need to be optically flat to within A/50 to A/100; the 
material must have an extremely uniform refractive index; mechan- 
ical strain in mounting must be avoided; and the temperature 
must be maintained constant to about ±0-01 deg C. 

Quartz plates are preferred because of their excellent thermal 
stability. To obtain the maximum possible reflection at the surfaces, 
the incident light is best linearly polarised with the electric vector 
perpendicular to the plane of incidence (Chapter 3), the extraordi- 
nary ray being made use of in a quartz plate. 



MULTIPLE-BEAM 1NTERFEROMETRY 177 

Exercise 4 

1. Give an account of the theory of either the Fabry-Perot interferometer or the 
Michelson interferometer, and describe in detail one application in metrology 
or spectroscopy of the instrument you have discussed. (M.P.) 

2. Describe the optical arrangement used with a Fabry-Perot etalon and the 
fringe system observed with monochromatic light. How are the fringes pro- 
duced and what factors determine the chromatic resolving power of the 
etalon? How may the difference in wavelength between a single spectral line 
of known wavelength A and a close satellite of wavelength A +d A be deter- 
mined from measurements on their respective fringes of the same order n! 

(L.G.) 

3. How would you determine the separation of two plane parallel plates about 
1 cm apart to an accuracy of 1 in 10 5 , given sources of standard wavelengths ? 
Having found this separation, how would you determine precisely the sepa- 
ration of another pair of plates whose distance apart was almost twice that 
of the former pair? (L.P.) 

4. Explain the formation of fringes by a Fabry-Perot interferometer, and de- 
scribe, with the necessary theory, how the instrument can be used in the quan- 
titative examination of the hyperfine structure of spectral lines in the visible 
region. Give a brief account of the experimental arrangements necessary for 
such an investigation, and explain how the resolving power is affected by the 
separation and reflecting power of the interferometer plates. (L.P.) 

5. Describe the Fabry-Perot interferometer, discussing the features which give 
it a high spectroscopic resolving power. 

It is desired to resolve two radiations emitted by a gas laser whose wave 

Y numbers differ by 0-005 cm -1 . What interferometer separation is required 

if the reflectivity of the plates is such that it is possible to resolve two 

fringes separated by a tenth of the separation between the orders? (L.P.) 

6. Assuming that a narrow metal rod about 50 cm long, with highly polished 
plane ends is provided, suggest some interferometric method whereby the 
length may be measured with high precision in terms of wavelengths of 
light. 

Give an account of the technique to be adopted and discuss the character of 
the light source to be used. (L.P.) 

7. Spectrograms of the sun takenfrom opposite ends of the diameter perpendic- 
ular to its axis of rotation show that the corresponding Fraunhofer sodium 
lines have a relative displacement of 0078 A. Assuming that the diameter of 
the sun is 1-39x10" cm, that the mean wavelength of the light is 5893 A 
and that the velocity of light is 3x 10 10 cm sec -1 , obtain a value in days for 
the period of rotation of the sun about its axis. Derive any formula you use 
and suggest, with reasons, the type of spectrograph it would be necessary to 
use to obtain wavelength measurements of the required precision. (L.P.) 

8. Give reasons for the choice of the wavelength of the orange line of krypton 
86 as a standard of length. Is it likely that a standard of greater precision will 
become available in the future? 

A spectrum line having two components is observed using a Fabry-Perot 



178 UNIVERSITY OPTICS 

interferometer with a 1 cm plate separation, and it is found that the angular 
separation of the two components is one-tenth that between orders. If the 
mean wavelength is 5-OX 10 -5 cm, calculate the wavelength difference be- 
tween the components. (L.P.) 

9. Write an account of interference filters designed to pass a fairly narrow band 
of wavelengths in the visible region of the spectrum. 
Given that the refractive index of magnesium fluoride deposited in vacuo and 
subsequently baked in air is 1-37, calculate the thickness of a film of this 
material required to form an interference filter of the Fabry-Perot type which 
will selectively transmit light in a narrow region about the wavelength of 
5,550 A at which the human eye has its maximum sensitivity in daylight. 

10. Describe the Lummer-Gehrcke interferometer plate and deduce an equation 
which gives approximately its chromatic resolving power. 

Why has the Lummer-Gehrcke plate become almost obsolete in favour of 
the Fabry-Perot instrument? 

11. Give the theory of the formation of fringes by the transmission of a plane 
monochromatic beam of light through two adjacent, parallel, optically-flat 
and partially reflecting surfaces. 

It is desired to form an interference filter which will pass only substantially 
monochromatic radiation of wavelength 6000 A. Describe a method of doing 
this and state, without mathematical detail, how the band-pass may be 
controlled. (Poly,) 



CHAPTER 5 



Spectroscopy 



5.1 Emission Spectra 

Bunsen and Kirchhoff did much of the original work in spectros- 
copy. In 1860, they gave the explanation of the Fraunhofer lines in 
the solar spectrum. They also invented a prism spectrometer incor- 
porating an illuminated scale. Their most important discovery was 
that, under specified conditions, each element emitted light produc- 
ing a definite spectrum characteristic of the element. 

A main purpose of optical spectroscopy is the rapid analysis of 
materials by examination of either the emission or the absorption 
spectra. This book is concerned with spectroscopy utilising electro- 
magnetic radiations in the visible, ultra-violet and infra-red re- 
gions. Theimportantand rapidly developing sciences and techniques 
of X-ray spectroscopy, y-ray spectroscopy and spectroscopy in the 
microwave radio region are not to be described. The ranges of 
wavelengths, frequency and photon energies of the radiations to be 
considered are shown in Table 5.1. 

In the solid state, atoms and molecules are closely packed to- 
gether. For example, tungsten wire is polycrystalline material in 
which atoms are locked in the lattice structure at separations of 
about the same size (1-2 A) as those of the atoms themselves. 
Consequently, the energy levels occupied by the electrons in the 
atoms in a solid are affected by the proximity of neighbouring 
atoms. Isolated atoms (considered as atoms sufficiently far apart 
for their mutual influences on electronic structure to be negligible) 
will, when excited, emit a spectrum characteristic of the atom in the 
form of a series of lines of discrete frequencies. Atoms (or molecules) 
in solids, on the other hand, will have their energy levels so consider- 
ably modified by proximity that a continuous band of energy levels 
is concerned and not a series of specific separated ones. 

179 



180 



UNIVERSITY OPTICS 



Table 5.1. 

ULTRA-VIOLET, VISIBLE AND INFRA-RED RADIATION 



Radiation 



Wavelength range 
(cm) 



Frequency range 
(Hz) 



Range of 

photon 

energies* 

(eV) 



Ultra-violet 
Visible light 
Infra-red 



10- 6 to4xl0- 5 
4X10- 5 to7xlO- 
7X10- 5 to 10- 2 



3xl0 16 to7-5Xl0 u 

7-5xlO u to4-3xl0 14 

4-3Xl0 14 to3xl0 12 



124 to 3-1 

3-1 to 1-78 

1-78 to 

00124 



* Calculated from the equation energy = h X frequency, where h = 6*625 X 10~ 34 J s 
is PJanck's constant. The energy thus given in joules is converted into electron-volts on 
the basis that 1 eV = 1-6 X 10" 19 J. 

In a gas or vapour, the mean free paths between atoms (or mol- 
ecules) are great enough except at high pressures for the atoms to be 
isolated from one another. For example, in nitrogen gas at s.t.p., 
the mean free path is 6-6X 10 -6 cm approx. — some 200 times the 
molecular diameter. 

It is apparent, therefore, that the characteristic wavelengths in 
emission spectra are only obtained if the substance is in the vapour 
or gaseous state. Line spectra will be obtained for atoms ; a band 
spectra for molecules. 

The solid, when heated, will emit a continuous spectrum because 
of its continuous energy-band structure. This spectrum will not be 
characteristic of the material of the solid, but will depend on the 
temperature and emissivity. Thus the continuous spectrum in the 
light from a heated tungsten filament lamp is similar to that from 
a lamp with, say, a molybdenum filament. 

Referring to Table 5.1, it is seen that energy must be made avail- 
able to the vaporised atoms of a substance to excite them to states 
so that photons are emitted. The possible sources of radiation for 
this purpose are the flame, the discharge tube, the arc and the spark. 



5.2 Flame Sources 

A Bunsen burner flame is at a temperature T of about 1,200°K. 
The most probable energy of the particles in the flame is kT, where 
k is Boltzmann's constant = (1/11,600) eV deg"" 1 K. Therefore, 
the most probable energy in the Maxwell-Boltzmann distribution is 
about 01 eV, which is far too small to provide the necessary ener- 
gies for photons in the visible region. Nevertheless, the alkali met- 



SPECTROSCOPY 181 

als emit visible spectra when in a flame source, as is well-known for 
sodium chloride introduced into a Bunsen flame. Sufficient energy 
is available as a consequence of exothermic reactions that the sub- 
stance undergoes with gases in the flame and the atmosphere. 
The alkali metals, in particular sodium, potassium and lithium, 
will emit some, but not all, of the lines in their characteristic spectra 
when heated in a Bunsen flame because these metals are strongly 
chemically active. Thus, sodium chloride will dissociate in the reac- 
tions taking place in a Bunsen flame and emit the characteristic 
spectrum of sodium. 

For the majority of materials, however, a more effective source 
than a flame is necessary, and it is clear from Table 5.1 that this 
source of excitation needs to be more energetic the shorter the 
wavelength of the radiation emitted. 



5.3 Discharge Tubes 

The discharge tube contains gas at pressures ranging from about 
0-1-10 mm Hg. Electrodes are sealed into opposite ends of the 
tube. A p.d. of several hundred volts maintained across these elec- 
trodes with a ballast resistance in series causes an electrical dis- 
charge to pass; the gas atoms and molecules are excited and ionised, 
and the glowing positive column in the discharge fills most of the 
tube. As compared with the arc source (Section 5.4), the glow dis- 
charge source operates with higher potential gradients in the gas and 
with smaller current densities. 

An early example, still widely used in college laboratories, is the 
Geissler tube [Fig. 5.1(a)]. This consists of a capillary tube about 10 
cm long and about 1 mm bore, between wider end tubes in which 
the electrodes are sealed. The capillary tube is aligned with the 
spectrometer slit with or without an intermediate focusing lens. 
The p.d. across the electrodes can be d.c. from an induction coil or 
a power pack or a.c. from a transformer. A ballast resistance is 
needed to limit the current in the discharge once the glow discharge 
is excited. 

Such tubes are particularly suitable for the excitation of gases. 
The inert gases, particularly helium (He), neon (Ne) and argon (A), 
are monatomic and will emit their characteristic atomic line spectra. 
Other gases, such as nitrogen (N 2 ) and carbon dioxide (C0 2 ), will 
emit band spectra, but less stable molecules (for example, of hydro- 
gen, H 2 ) are dissociated so that atomic line spectra are produced. 

Elements which do not occur in the vapour state except at high 
temperatures can be excited in a Geissler tube by the use of an 
auxiliary gas, usually the inert gas argon. For example, the spec- 
trum of iodine is emitted if this element is present within a Geissler 



182 




UNIVERSITY OPTICS 

METAL-TO-GLASS SEAL 



ALUMINIUM OR STAINLESS STEEL 
CYLINDRICAL ELECTRODE 



DISCHARGE GLOW 
IN CAPILLARY TUBE 



ELECTRODES 



ARC DISCHARGE IN 
RARE GAS AND 
METALLIC VAPOUR 



„METAL-TO-GLASS SEAL 




GLASS CYLINDRICAL 
SUL 



GLASS 'INCH 



Discharge tube sources of spectra: (a) Geissler tube; (b) commercial 
spectral lamp 



tube containing argon. The spectra will, in general, result from both 
the atoms present and their positive ions, predominantly those which 
are singly charged. 

If the tube is made of glass, ultra-violet radiation will be severely 
reduced by absorption. Quartz tubes will permit transmission of the 
ultra-violet down to about 2,000 A. 

Typical of more sophisticated discharge sources are commercial 
spectral lamps. These consist of a small discharge tube surrounded 
by an outer cylindrical bulb [Fig. 5.1(b)]. Within the discharge tube 
are electrodes sealed in at the two ends, and either a gas, a metallic 
vapour or a mixture of the two (for example, argon and sodium). 
This type of lamp is operated at considerably higher current density 
and higher pressure than the Geissler tube and has a much larger 
light output. Indeed, the discharge is a self-sustaining arc, rather 
than a discharge, in which fairly well-defined regions such as the 
positive column appear. For example, the range of spectral lamps 
obtainable from Philips Ltd have arc lengths of between 25 mm 



SPECTROSCOPY 183 

and 40 mm, depending on the type, and operate at a current of 
0-9 A. A typical power supply is an auto-leak transformer of which 
the primary is connected across the a.c. mains at 240 V r.m.s.; 
the secondary provides an open-circuit e.m.f. of 470 V r.m.s., 
which provides a suitable striking p.d. across the lamp electrodes. 
When the arc strikes, this p.d. falls to about 10-50 V depending on 
the nature of the element within the tube and the operating pressure. 

The discharge tube and outer cylindrical bulb are of glass for 
sources in the visible region of the spectrum, but of quartz for ultra- 
violet down to 2,000 A approx. 

For college optical experiments, the most useful of these lamps 
are those giving the spectra of cadmium, sodium, mercury, thallium 
(which gives a line at 5,350 A of width only 0-05 A) and gallium 
(giving intense sharp lines at 4,033 A and 4,172 A). Lamps of this 
type with inert gas fillings (helium, neon, argon and krypton) are 
also useful. Lamps with high-pressure fillings, particularly of 
argon, xenon or krypton at 3-8 atm, give intense light outputs 
through the visible spectrum in a continuum; on the other hand, 
the lower-pressure lamps provide line spectra. The high-pressure 
xenon lamp gives a continuous spectrum which is a good approxi- 
mation to the solar spectrum. 

Interference filters (Section 4.7) of the multi-layer dielectric 
types are valuable adjuncts to these lamps. Placed within a beam 
of collimated light from the lamp, such filters are available with 
a narrow band-pass in appropriate regions to cut off all but the 
selected lines required. 

A hydrogen discharge lamp of special design is a useful source 
of continuous radiation in the ultra-violet region. The discharge 
takes place between a heated cathode and an enclosed anode. The 
covering of the anode has a small opening of 1 mm diameter 
through which the discharge is compressed to give a spot of high 
radiation density. The striking p.d. is about 320 V and the operating 
p.d. is 100 V. Automatic control of the anode current holds the 
radiation output constant to ±0-2% for mains voltage changes 
between 180 V and 240 V. With a quartz envelope, the radiation is 
free of lines and is continuous between 2,000 A and 3,500 A. 

A recent innovation in place of the hydrogen lamp is a discharge 
lamp containing deuterium (the heavy isotope of hydrogen of mass 
number 2). This gives more than twice the intensity of a hydrogen 
lamp of corresponding size and power consumption. A version 
(Ealing Scientific Ltd), with a special optical window made of 
Suprasil, enables the continuous ultra-violet spectrum down to 
1,650 A to be transmitted. 

The Paschen hollow-cathode source is a discharge tube in which 
a cold cathode containing a cavity is used. The discharge is excited 
in an inert gas at low pressure (10 _2 -10 _1 mm Hg) by a high p.d. 



184 UNIVERSITY OPTICS 

established between this cathode and an anode. The cathode metal 
is sputtered by the impinging positive ions of the rare gas. The 
hollow-cathode design reduces the perturbing effects of electric 
fields and gas pressure on the breadth of the spectral lines. Sharp 
lines of the spectrum of the excited atoms of the metal sputtered 
from the cathode are therefore produced. These sharp lines are 
mostly due to the neutral atoms and their singly positively charged 
ions. 

Electrodeless discharges are produced by surrounding a glass or 
quartz tube containing gas at a pressure of 10 -2 -10 -1 mm Hg 
by a coil carrying radiofrequency current. The tube does not 
require internal electrodes, and the electric field strengths set up 
by the radiofrequency electromagnetic radiation are moderate. The 
spectral lines produced are consequently primarily due to neutral 
atoms and are sharp relative to those from Geissler tube sources. 
Such electrodeless discharge sources can also be used to emit the 
line spectra of non-volatile elements : a vaporisable compound of 
the element is incorporated in the tube, which contains an inert 
gas. This compound dissociates in the discharge, and the spectra 
of the elements resulting are obtained. 

To produce spectral lines which are extremely narrow, atomic 
beam sources are used. Such lines of very small breadth in wavelength 
are needed in studies of hyperfine structure with, for example, the 
Fabry-Perot interferometer. To ensure extreme sharpness, the ther- 
mal motion of the radiating atoms has to be minimised to reduce 
Doppler broadening (Section 5.8). A beam of atoms of the element 
whose spectrum is required is produced by evaporating the element 
in a small oven maintained at high temperature in a vacuum. The 
ejected atoms are collimated into a beam by passage through succes- 
sive aligned apertures. The atoms in this beam are excited by 
electrons emitted from a thermionic filament and are accelerated 
to an anode on the opposite side of the beam. The spectrum emitted 
is observed in a direction perpendicular to that of the atomic beam. 
As the random motion of the atoms along this direction of obser- 
vation is very small, Doppler broadening is small. The best practice 
is to excite the atomic beam by electrons directed at right angles 
to it and make observations from a direction perpendicular to both 
the atomic beam and the electron beam. The intensity of light in 
the spectrum is unfortunately unavoidably limited. 



5.4 Arc Sources 

Electric arcs in the atmosphere set up between electrodes of carbon 
or of certain metals are widely used in spectroscopic analysis as 
sources of characteristic radiation. 



SPECTROSCOPY 



185 



The d.c. carbon arc (Fig. 5.2) will now be described to illustrate 
the operation of arc sources. Two carbon rods are supported verti- 
cally in line within a support which enables the end of one rod to be 
moved against the other and then separated again. A source of 
e.m.f. (from a battery, the d.c. mains or a power pack of 50-240 V) 
is set up across the carbon rods with a ballast resistor in series — nec- 
essary because the arc has a negative resistance characteristic. 



MILLED HEAD TURNED TO 
RAISE AND LOWER TOP 
ELECTRODE BY MEANS OF 
A RACK AND PINION 



TERMINALS 



BALLAST 
RESISTOR 



D.C. SUPPLY 
(50-240V) 




MILLED HEAD TO RAISE 
OR LOWER PILLAR BY 
MEANS OF RACK AND 
PINION 



Fig. 5.2. The d.c. carbon arc 

The ends of the rods are pushed together and then drawn apart 
gradually. A bright arc at a temperature of 3,000°-4,000°C is 
thereby produced across the ends, which are ordinarily at between 
1 mm and 10 mm apart. As the arc current passes, the end of the 
positive carbon rod (usually the lower one) becomes hollowed 
out to a crater, whereas the end of the negative rod becomes pointed. 
The end of the positive rod is the hottest part and it emits a con- 
tinuous spectrum. The vapour between the rods emits the many 
lines in the spectrum of carbon. Instead of the rods being moved, 



186 UNIVERSITY OPTICS 

the arc may be ignited by momentary application of a high p.d. 
across the electrodes to cause an initiating spark. 

To study, with the carbon arc, the line spectra of other elements, 
a small amount of the material in the form of the element of a 
suitable salt is placed in the crater of the lower positive rod or, 
better, the centre of this rod is bored with an axial hole which is 
fed with the material. The material dissociates (if a compound), 
volatilises and the excitation of its vapour in the arc produces the 
required emission spectrum. 

For many metals, such as iron, copper and aluminium, the arc 
may be formed between rods of these metals themselves in place 
of the carbon electrodes. The positive rod is likely to melt if the 
current is too large. The p.d. across the arc is between 30 V and 
60 V, with a current of 1-5-20 A. 

Owing to the high temperature of the arc compared with that of 
the flame, most of the energy levels of neutral atoms are excited, so 
giving a spectrum containing many more lines than in the flame 
spectrum. If the vapour in the arc is of a particularly stable com- 
pound which does not dissociate at temperatures of 4,000°C, band 
spectra are produced. 

The higher the p.d. across the arc, the less likely is flicker. The 
temperatures produced are sufficient to vaporise all materials. 
Careful cleaning of the electrodes is necessary to avoid unwanted 
spectral lines due to contamination. High-voltage a.c. arcs are less 
prone to flicker and movement of the point of contact of the arc 
over the negative electrode surface, so such arcs are preferable 
for quantitative analysis in spectrochemical work. Satisfactory 
operation at above 2,000 V r.m.s. from a transformer requires, 
however, a fairly complex supply source. 

Arc sources are also operated at sub-atmospheric pressures and 
at pressures of several atmospheres, but the latter become high- 
pressure arc lamps giving a continuous or mixed spectrum rather 
than a line spectrum. 

The arc method cannot detect the carbon or hydrogen in organic 
compounds, nor the oxigen content of rocks. Materials which melt 
and then do not vaporise can be mixed with powdered silica to 
prevent fusion. Powdered silica may also be soaked in a liquid to 
be analysed. Some components present in small quantities may 
disappear quickly and so must be recorded by photographic or 
photoelectric methods. 



5.5 Electric Spark Sources 

Electric spark sources are similar to arcs, except that the p.d. 
is intermittent and is much greater in the range 10-50 kV, and 



SPECTROSCOPY 

R (LIMITS POWER USUALLY TO -e I kW) 



187 



I WW 



Zmm 

SAP — 



-nnnp- 



AIR-CORED 
INDUCTOR 



Fig. 5.3. The spark source 



the current density is less. The spark is produced across the gap 
between electrodes by connecting them across the secondary of a 
transformer with an inductor in series (Fig. 5.3). When the p.d. 
across the gap rises to the value at which breakdown occurs, an 
oscillatory surge of ions and electrons passes between the electrodes, 
forming the spark. The oscillation frequency, and hence the 
repetition rate of the spark, depends on the inductance and capac- 
itance in the circuit. At atmospheric pressure, the gap is 1-2 mm 
and the p.d. about 10 kV. Sources in which the spark is enclosed 
in a vessel at low pressure are also used, in which case the p.d. 
may be as much as 50 kV. 

Temperatures of some 10,000°K are effectively produced in the 
spark, and the electrons produced are sufficiently accelerated in 
the intense electric field to ensure that atoms present are excited 
and ionised. The spectra emitted are therefore for high states of 
excitation and where a large fraction of the atoms become ionised. 
The ions will be singly charged or multi-charged (the atom losing 
more than one electron), but singly charged ions predominate 
usually. 

The possible methods of obtaining the spectra of various materials 
are: to make the electrodes themselves of the appropriate metal or 
carbon ; to allow liquids to drip from a funnel forming one electrode 
to a container forming the other electrode. 

The extent of the excitation and ionisation of the material in the 
spark depends upon the duration of the spark and the phase in 
the applied p.d. at which it occurs. Some control over this extent 
is therefore possible by selection of the inductance and capacitance 
in the oscillatory circuit associated with the transformer secondary. 
Better control is achieved by electronic regulation of phase and 
spark duration. 

Spark sources are useful in the visible and ultra-violet regions. 
Vacuum spark sources are particularly suitable in studies of the 



188 UNIVERSITY OPTICS 

spectra of multiply ionised atoms. To eliminate lines due to atmos- 
pheric nitrogen and oxygen, an alternative to the vacuum spark 
is to run the spark in an atmosphere of hydrogen which has a 
simple spectrum. 

The arc or the spark source can be used to obtain the emission 
spectra of 65 metallic elements; however, none of the 10 non- 
metallic elements (arsenic, antimony, boron, carbon, iodine, 
silicon, selenium, sulphur and tellurium) has satisfactory lines in the 
visible region with an arc spectrum but they do in a spark or 
discharge tube spectrum. All but iodine and sulphur have arc lines 
in the ultra-violet spectrum; boron has no visible spectrum. 



5.6 Absorption Spectra 

For many materials, emission spectra cannot be produced because 
the materials cannot readily be introduced into flame, gas discharge 
tube, arc or spark sources. The main difficulties are with liquids, 
some gases, and coopounds which dissociate. The technique of 
absorption spectroscopy is therefore practised extensively. The 
material in solid or liquid form is transparent to radiation in one 
or more of the ultra-violet, visible of infra-red regions. A sample of 
the material, usually in plate form or as a liquid in a transparent 
parallel-sided container, is placed between a radiation source and 
the slit of the spectrometer, so that the sample is in the continuous- 
spectrum beam from the source. The spectrum obtained is due to 
absorption of energy in lines, bands or a continuum, depending on 
the nature of the material: an absorption line spectrum is obtained 
for elements, a band spectrum for molecules, and a continuum for 
several dye-stuff and other materials. Certain parts of the continuous 
spectrum from the source will be absent or reduced in intensity in 
the recorded absorption spectrum. For example, the well-known 
doublet lines at 5,890 A and 5,896 A in the spectrum of sodium will 
appear as black lines against a continuum in an absorption spec- 
trum. 

The source of the continuous spectrum used in absorption spec- 
troscopy is a tungsten filament lamp or a high-pressure xenon or 
krypton arc for work in the visible region, and a hydrogen or 
deuterium discharge lamp in the ultra-violet region. 



5.7 Sources of Infra-red Radiation 

In general, absorption spectroscopy is the usual technique for 
analysis in the infra-red region, particularly of the band spectra of 
liquid and gaseous compounds. 



SPECTROSCOPY 189 

A source of continuous infra-red radiation is required. The ideal 
source is a black-body radiator; this is difficult to provide, but 
several other methods form satisfactory substitutes even though 
the emissivity of the material used is not unity. 

The Nernst glower is a rod of zirconium oxide with the addition 
of yttrium oxide and rare earths. It is a non-conductor at tempera- 
tures below 800°C. It is raised to this temperature in a gas flame 
or by a surrounding electric heater and is then heated by the passage 
through the rod of electric current. The temperature may be varied, 
but the current must be sufficient to prevent the temperature de- 
creasing below 800°C. The distribution of radiation with wavelength 
is Planckian with a peak energy at a wavelength of 22,700 A for a 
temperature of 1,000°C (see Section 6.6 of Volume 1) and an 
emissivity of 0-6. This glower is operated in air, so the energy againts 
wavelength distribution will show the effect of carbon dioxide and 
water vapour absorption. The globar source is a silicon carbide rod 
heated electrically. (See also Section 5.36.) 

5.8 Causes of Spectral Line Broadening 

Instruments of low chromatic resolving power, such as a small 
prism spectrometer, are unable to reveal the fine structure of a 
spectral line. With a very narrow slit, the doublet structure of the 
yellow sodium spectral lines which differ in wavelength by approxi- 
mately 6 A can just be revealed. Instruments of high resolution not 
only show fine structure but reveal that the intensity of each line 
is greatest in the middle and decreases towards the edges and also 
that lines have different breadths. 

The width of a spectral line is an indication of the temperature 
of the source — a fact of use in astronomy for the estimation of the 
temperatures of stars. This effect results from Doppler's principle 
and is known as Doppler broadening. Radiating atoms (and molec- 
ules) have velocities of translation, and these velocities will have 
components ranging from zero to well above the mean value in the 
line of sight. Atoms will continue to produce radiation of their 
characteristic frequencies, buttheir velocity will effect the wavelength 
measured so that all the radiation for a particular frequency will not 
be confined into one infinitesimally narrow spectral line. The 
broadening of the line resulting may be of such magnitude as to 
obscure the fine structure. Rayleigh states that the brightness of the 
line should be proportional to exp ( — K<j> 2 ), where # is the distance 
from the centre of the line in angular measure, and K is a constant 
which is larger the narrower the line. The value of A: increases with 
decrease of the velocity of translation, which will diminish with the 
temperature and with the mass of the radiating atom (Fig. 5.4). 



190 



UNIVERSITY OPTICS 




(a) (b) 

Fig. 5.4. Intensity distribution in a spectral line: (a) K large; (b) K small 

At high pressure, interactions between radiating atoms in close 
proximity to each other will modify the frequencies of their radi- 
ation. The lower the pressure, the narrower the spectral line radi- 
ating. This effect is known as pressure broadening. 

An atomic oscillator loses energy by radiation. The amplitude 
of the wave decreases as it is emitted and thus is no longer a simple 
harmonic wave of one frequency. Fourier analysis reveals that the 
apparently infinite wave train can be resolved into numerous com- 
ponent waves differing only very slightly in frequency from the 
fundamental frequency but sufficiently to produce an apparent 
broadening of the line. This effect is known as radiation broadening. 

For purposes such as the study of fine and hyperfine structure, 
spectroscopically pure materials are necessary. Isotopically pure 
elements must therefore be used, because variations in the mass 
and the spin of the nucleus affect the frequency of the radiation 
(Section 2.5). The presence of a range of isotopes can so broaden 
a spectral line that other causes of line broadening are masked. 

The ideal source is an isotopically pure material electrically 
excited at low pressure and at low temperature. The kinetic velocities 
of atoms are proportional to the square root of the absolute temper- 
ature. Cooling a source from 293°K (20°C) to the temperature of 
liquid nitrogen 77°K (— 196°C), however, will only approximately 
halve the breadth of a spectral line caused by the kinetic motion of 
the radiating atoms. 



5.9 Detection of Radiation 

The detectors and principles of detection of radiation which are 
used in spectroscopy are listed. 

The human eye: sensitivity extends from 4,000 A to 7,000 A 



SPECTROSCOPY 191 

approx., with peak sensitivity at 5,550 A in bright light (Sec- 
tion 3.2 of Volume 1). 

Photography: the common photographic emulsion is sensitive 
in the visible spectrum and in the ultra-violet; special emul- 
sions for use in the ultra-violet and the infra-red are made. 

Photoelectric devices: these include vacuum photocells, gas- 
filled photocells, photomultipliers, barrier layer cells, the 
photodiode and the phototransistor. 

Photoluminescence : many substances emit radiation when pho- 
tons are incident and where this radiation is of longer wave- 
length than that of the incident radiation. The luminescent 
emission persists after the excitation due to the photons in 
the incident radiation is removed. This emission decays ex- 
ponentially in a manner depending on the life time of the state 
excited in the luminescent material. The exponentially decay- 
ing luminescence is known as fluorescence. Some luminescent 
materials also exhibit an afterglow which decays more slowly 
than the fluorescence and in a more complex manner; this 
phenomenon is called phosphorescence. 

Thermal detectors: these include the bolometer, in which the 
increase of resistance of a metallic wire or strip with tem- 
perature rise brought about by incident radiation forms the 
basis of the sensing device; thermistors, which are mixtures 
of oxides (including those of iron, nickel, copper, zinc and 
manganese) which have large negative temperature coefficients 
of resistance and are therefore alternative sensors to the 
bolometer; the thermojunction and the thermopile which make 
use of the thermoelectric effect; and the Golay cell, in which 
incident thermal radiation increases the pressure of gas in an 
enclosure and where this pressure rise is indicated by the 
movement of a diaphragm. (See also Section 5.36.) 



5.10 Photographic Emulsions 

Both films and plates are used, but plates are usually preferred 
because of small changes in the dimensions of film due to shrinkage 
in processing. 

Panchromatic emulsions are sensitive throughout the visible 
region and in the ultra-violet, but the spectral response is consider- 
ably different from that of the human eye (Fig. 5.5). Such emulsions 
can be used down to wavelengths of about 2,200 A in the ultra- 
violet; the spectrograph has to be furnished with quartz optics. 
Below 2,200 A, gelatine (a prime constituent of the usual photo- 
graphic emulsions) absorbs ultra-violet radiation strongly and so 



192 



UNIVERSITY OPTICS 



normal photographic plates are useless. This problem can be over- 
come by smearing the photographic plates with paraffin oil, 
which is fluorescent and so converts the short wavelengths to 
longer ones which are not absorbed by the gelatine. After exposure, 
the paraffin is removed with acetone and the plates are processed 
normally. This method allowed Lyman to undertake spectrog- 
raphy in the ultra-violet down to wavelengths of 580 A. 

The more modern method is to use Schumann photographic 
plates, which are available commercially. These contain only a 
trace of gelatine in the emulsion to bind the photosensitive silver 
bromide. 

At wavelengths below about 1,800 A, ultra-violet radiation is 
absorbed in air and absorption in quartz is considerable. The 



20 



3.500 4.500 5.500 6.500 

WAVELENGTH (Al 

Fig. 5.5. Spectral response curve of a typical panchromatic photographic emulsion 
(a grating spectrometer was used with a tungsten lamp source at 2,85CPK) 

spectrograph therefore has to be evacuated. In vacuum spectrog- 
raphy, fluorite prisms and optics are used down to 1,200 A; 
for shorter wavelengths than this, ultra-violet absorption in fluorite 
is'pronounced so reflection-type diffraction gratings are used. With 
Schumann or Q emulsions in a vacuum grating spectrograph, 
records at wavelengths down to a few Angstrom units (where the 
ultra-violet region overlaps the soft X-ray region in the electro- 
magnetic spectrum) are possible. 

The light sensitivity of photographic emulsions can be extended 
into the infra-red region of the spectrum by treating the emulsion 
with photosensitive dyes. Since 1930, panchromatic emulsions sen- 
sitive to wavelengths up to 14,000 A have been made by incorporat- 
ing polycarbocyanine dyes in the emulsion. 



5.11 Photoelectric Detectors 

The various photoelectric devices whose characteristics (including 
spectral response) are described in Section 6.18 of Volume 1 are 
all usable in spectroscopy. The semiconductor devices (the photo- 



SPECTROSCOPY 



193 



diode and the phototransistor) are tending more and more to 
replace the vacuum photocell in spectroscopy and spectropho- 
tometry. 



5.12 Use of Fluorescence 

Many substances fluoresce when exposed to ultra-violet radiation. 
The light emitted is within the visible spectrum. A few examples are 
given in Table 5.2. Cadmium sulphide is strongly phosphorescent. 
After illumination with ultra-violet light, it emits a pale blue glow 
for about an hour. 

Table 5.2. 

SOME FLUORESCENT MATERIALS 



Substance 


Actual colour 


Colour of fluorescent 
light emitted 


Quinine sulphate 


colourless 


blue 


Fluorspar 


colourless 


violet 


Fluorescein 


brown 


green 


Vaseline 


yellow 


green - 


Barium platinocyanide 


yellow 


green 


Uranium oxide 


yellow 


green 


Chlorophyll 


green 


red 



5.13 The Thermopile 

Thermal detectors, unlike photographic plates and photocells, 
are non-selective; they absorb radiation over the entire spectral 
range from the ultra-violet to the infra-red. This is because the 
receiver of radiation is a black-body. 

A useful type of thermopile in spectroscopy is that due to Schwarz 
(Fig. 5.6). This consists of a number of thermojunctions [between 
bismuth and silver in Fig. 5.6(a), but semiconductor materials are 
used in recent designs] in which alternate junctions are arranged 
in line, the line of hot junctions being behind a slit. In this linear 
thermopile, the hot junctions are blackened, usually of copper 
foil to which the bismuth and silver are soldered. Radiation inci- 
dent upon the slit is absorbed by this line of hot junctions. The 
'cold' junctions, also of foils to which silver and bismuth are 
connected, are shielded from the incident radiation and sufficiently 
separated from the hot junctions to ensure that they remain at 
constant temperature. 



194 



UNIVERSITY OPTICS 



BISMUTH 



SILVER 



TO GALVANOMETER 




CYLINDRICAL METAL TUBE AROUND - 
EVACUATED CLASS TUBE CONTAINIHS 

THERMOPILE S 



BLACKENED COPPER FOILS 

(EACH I-5MX05B.) ON WHICH 

RADIATION IS INCIDENT THROUSH SLIT S 




(b) 



ENTRANCE SLIT 
(WINDOW OF SLASS,O.UARTZ,FLUORITE OR POTASSIUM BROMIDE) 



Fig. 5.6. Schwarz linear thermopile 



Such linear thermopiles are mounted in air or in a vacuum. 
The vacuum linear thermopile is usually chosen for infra-red 
spectroscopy as it is much less susceptible to the effects of external 
radiation and much more sensitive than the air thermopile; how- 
ever, it is less rapid in response. 

The vacuum linear thermopile is mounted within an evacuated 
sealed-off glass tube with metal-to-glass sealed lead-in wires. This 
glass tube has in its wall opposite the line of hot junctions a narrow 
window of glass, quartz, fluorite, potassium bromide, caesium 
bromide or KRS 5 (see page 204) depending on the wavelength 
range in the radiation to be measured. The glass tube is within a 
metal tube with a slit opposite the window [Fig. 5.6(b)]. 

Typical of commercial vacuum linear thermopiles is the FT16 
Schwarz type (Hilger and Watts Ltd). Each element has an area of 
9-0 mmX0-5 mm, the series of hot junctions comprising one 
element (being behind the window and slit) and the cold element 



SPECTROSCOPY 195 

being shielded from radiation. Each element has a resistance of 
200 D and has separate connecting leads so that connection of 
the elements may be either in series or in parallel opposition. 
The response time is 0-1 s, and the sensitivity is 25 (xVfiW- 1 
(i.e. an output e.m.f. of 25 (iV is produced for an incident radiation 
of 1 fiW). Because the hot junction is blackened, this sensitivity is 
independent of wavelength. The calibrated linear thermopile thus 
forms an absolute instrument in that the incident radiation is mea- 
sured directly in microwatts. 

5.14 The Bolometer 

The bolometer consists of two strips of metal, each about 5x 10~ 5 
cm thick and both blackened. One strip is exposed to the radiation, 
and the other is shielded from the radiation. These strips form two 
arms of a Wheatstone bridge. The sensitivity depends on the p.d. 
applied across the bridge and on the current sensitivity of the 
galvanometer. To avoid Joule heating, the p.d. applied must not 
cause too great a current to pass through the strips. Provided the 
temperature difference between the strips is small, the galvanom- 
eter deflection is proportional to the intensity of the radiation. 
A temperature change of 0-01°C can be measured. 

Galvanometer amplifiers may be used, but they are limited 
because the magnitude of the thermoelectric current due to the 
small amount of energy in a single spectral line may be comparable 
with currents generated by motions of the galvanometer coil 
caused by Brownian motion. 

5.15 The Thermal Golay Cell 

The Golay cell, which utilises the expansion of a gas, is remark- 
ably sensitive and it has become much used in recent work in 
infra-red spectroscopy, especially in the far infra-red (Section 5.36). 

Radiation passes through a window transparent to infra-red 
radiation into a small cell filled with gas and containing an absor- 
bing film of a few square millimetres in area. Heat is transferred 
to the gas in an amount depending on the modulation of the inci- 
dent radiation by an interposed rotating sector S, which is motor 
driven. Increase of temperature of the gas causes a change in 
pressure which moves the flexible mirror forming the back of the 
cell. (Fig. 5.7) 

The mirror is part of an optical system comprising a lamp, lens 
and line grating. With the optical set-up shown in Fig. 5.7, the 
grating is adjusted so that its image in the mirror coincides with 



196 



UNIVERSITY OPTICS 



FLEXIBLE MIRROR 



LINE GRATING 




Fig. 5.7. Use of a Golay cell 

the grating itself. On lateral movement of the grating, dark patches 
may be made to coincide with light portions of the grating so that 
the light cannot reach the photocell P via the plane mirror M. 
When the flexible mirror forming the back of the cell is bent as a 
consequence of a gas pressure increase in the cell, the grating image 
is moved relative to the grating itself so light reaches the photocell P. 
Therefore, the output of photocell P feeds to the electronic ampli- 
fier A an alternating p.d. of frequency equal to that of the rotating 
sector S, and of amplitude dependent on the extent to which the 
Golay cell mirror is flexed. The amplifier A is designed for this 
frequency. Any variation in p.d. applied to the lamp L, which 
illuminates the grating, will affect the photocell output and so 
spoil the accuracy of relative measurements. To avoid this, a second 
lamp L\ is used on the same supply as L, but its light is chopped 
by the sector S. This light is incident on a second photocell Pi, 
and a comparison circuit ensures that the instrument response is 
independent of supply fluctuations. 

The maximum response time is 30 ms, and the radiation inten- 
sity measurable is as small as 6X 10 -11 W. 



5.16 Spectrometers and Monochromators 

In spectroscopy, the essential requirement is the dispersion into a 
spectrum of the radiation from a source. This dispersion is achieved 



SPECTROSCOPY 197 

either by a prism or a diffraction grating. To achieve greater reso- 
lution, an interferometer is used— generally in conjunction with 
the spectroscope. With regard to the use of a prism or of a grating 
to form a spectrum, equations for angular dispersion and chromatic 
resolving power already developed in Chapter 12 of Volume 1 
are important. For convenience, these results are repeated here 
and further explanation given. 

A prism is of material of refractive index n which varies with 
the wavelength X in accordance with the Cauchy equation 

n = A+BX~ 2 (5.1) 

where A and B are constants for a given material. 

The angular dispersion of a prism is the change d0 of the angle of 
deviation consequent upon a small change dX in the wavelength X. 
It is therefore dO/dX. For a prism at minimum deviation and where 
the whole of the prism face is illuminated uniformly by light 
from a collimator, it is shown in Section 12.5 of Volume 1 that 
the angular dispersion is given by 

d0 2bB 

dX ~ ~al? (5,2) 

where b is the length of the base of an isosceles prism and a is 
the width of the beam of incident collimated light. The angular 
dispersion therefore increases with the length of the prism base 
for a given value of a and increases with decrease of wavelength 
(so is greater at the blue end of the spectrum). Also, with a given 
width of the incident collimated beam, the angular dispersion is 
doubled if two identical prisms are used in series. Note also that 
the presence of the negative sign in Equation 5.2 is indicative of the 
fact that the angle increases as X decreases. 

The chromatic resolving power of a prism, defined as XI bX where 
bX is the smallest change of wavelength discernable in accordance 
with the Rayleigh criterion at a mean wavelength X, is given by 
the equation 

_^ ,d«_262? 

bX dX~ X* (53) 

where again the whole of the face of an isosceles prism is illuminated 
uniformly by light incident from a collimator. The chromatic 
resolving power therefore increases as the wavelength decreases 
and also increases with length of the prism base. 

The angular dispersion of a plane diffraction grating is the change 
d0 of the angle of diffraction consequent upon a small change 
dX in the wavelength X. Equation 12.14 of Volume 1 shows that this 



198 


UNIVERSITY OPTICS 


angular dispersion is 


given by 






dd 

dA ~ 


P 




e cos 6 



(5.4) 



where p is the integral order of the spectrum, and e is the separation 
between corresponding points on neighbouring rulings on the 
grating. The angular dispersion of a grating hence increases with 
the order p; thus in the second order it is double that in the first, 
and it increases with the number of grating rulings per centi- 
metre (or l/e). 

As regards the variation of d0/dA with X, this will occur because 
cos varies with wavelength. With normal incidence, 6 is greater 
for the longer wavelengths so the dispersion is greater in the red 
end of the spectrum — which is opposite to the state of affairs 
obtained with a prism. 

In several cases of practical interest, 6 is near 0°, so cos = 1. 
The angular dispersion is then almost independent of wavelength : 
thus a normal spectrum with a linear scale of wavelengths is ob- 
tained. 

The chromatic resolving power Rofa diffraction grating is shown 
in Section 12.6 of Volume 1 to be given by the equation 

R = ^- = pN (5.5) 

It therefore increases with the total number of rulings N and the 
order p ; so in the third order, say, it is three times what it would 
be in the first order. 

A spectroscope in which, by direct or indirect calibration, indi- 
vidual wavelengths in the spectrum are measured is a spectrometer. 
If the spectrum can be permanently recorded by photography — a 
specially designed camera is attached to the instrument which 
takes the place of the telescope — the instrument is called a spectro- 
graph. Frequently, a given instrument can be used as either a spec- 
trometer or a spectrograph. 

After dispersion into a specrum, the instrument may be arranged 
or designed so that only a narrow band of wavelength emerges 
from its exit slit : the instrument is then known as a monochromator 
Generally, therefore, in a spectrometer or a spectrograph the object 
is to display the whole or a significant part of the spectrum, whereas 
in the monochromator the output is restricted to a narrow wave- 
length band (usually with facilities for selection of the mean wave- 
length desired). An instrument which is basically a spectrometer 
can be modified for use as a monochromator. 

The same principles apply in the ultra-violet, visible and infra- 



SPECTROSCOPY 199 

red regions of the spectrum, but the techniques adopted for the 
various regions will differ as they have to take into account the 
problems of absorption and detection of the radiation in the in- 
strument. 



5.17 The Constant-deviation Spectrometer 

A development from the simple prism spectrometer is the constant- 
deviation instrument. In the simple spectrometer, the telescope is 
moved so that various parts of the spectrum can be observed. In the 
constant-deviation instrument, a special prism is used: dispersion 
is obtained with the collimator and telescope permanently fixed at 
right angles to one another, and the prism is rotated to observe 
various parts of the spectrum. 

The constant-deviation prism (Fig. 5.8) is a single specially out, 
ground and polished block of optical glass which is, in effect, three 
prisms ABC (30°, 60°, 90°), BCD (45°, 45°, 90°) and ADE (30°, 60°, 
90°). The incident light from the collimator meets face AB at F, is 
refracted along FG, internally reflected at G at face BD to reach H 
where it emerges after refraction to the telescope. 

For a particular wavelength X, the image of the collimator slit is 
set on the cross-wires of the telescope. As the emergent beam HK 
has to be perpendicular to the incident beam IF because of the 
fixed collimator and telescope positions, the prism has to be rotated 
so that slit images for various values of A are made to coincide with 
the cross-wires of the telescope eyepiece. 

Within the 45° prism BCD, incidence along FG needs to be nor- 
mal to face BC and emergence along GH normal to face CD. There 
will then be no refraction within prism BCD, only internal reflection 
through 90° at G. The refraction and dispersion therefore occur 
only in prisms ABC and ADE. With the conditions prevailing, the 
angle of incidence will be equal and opposite to the angle of emer- 
gence 0. The two refractions then cause no total deviation; all the 
deviation is due to reflection at G. The dispersions within prisms 
ABC and ADE will, however, be in the same sense. 

As the angle of incidence and the angle of emergence <£ are 
equal, the arrangement is equivalent to that of a 60° prism at mini- 
mum deviation. This deviation is effectively zero owing to refrac- 
tion and 90° owing to reflection : it is constant irrespective of wave- 
length. 

Referring to Fig. 5.8, let n be the refractive index of the material 
of the prism at a wavelength A. Then n = (sin 0)/(sin R). As ^ BFG 
is clearly 60°, angle R must be 30°. Therefore 

sin = n sin R = 0-5n 



200 



UNIVERSITY OPTICS 



INCIDENT LIGHT 
I „ „■ ► 



FROM COLLIMATOR 




EMERGENT LIGHT 
TO TELESCOPE 



Fig. 5.8. The constant-deviation prism 



If at wavelength A the refractive index n is 1-6412, 6 = sin -1 0-8206 
= 55° approx. As n will depend upon wavelength, to bring the 
image of the collimator to coincide with the telescope eyepiece 
cross-wires, 8 will need to be varied by small rotations imparted to 
the prism. 

This rotation is imparted by a screw drive operated by turning 
a drum which is directly calibrated in wavelengths (Fig. 5.9). Rota- 
tion of this drum to a given position then enables light of a partic- 
ular wavelength to be observed in the telescope. If, instead of the 
eyepiece, a slit is placed at the focus of the telescope objective, 
a constant-deviation monochromator is available. The telescope can 
alternatively be replaced by a camera so that the dispersed spectrum 
is photographed, giving a constant-deviation spectrograph. 



SPECTROSCOPY 



SOURCE 




201 

CONSTANT DEVIATION PRISM 
MISM TABLE 



DRUM CALIBRATED 
IN WAVELENGTHS 



EYENECE 



Fig. 5.9. The constant-deviation spectrometer 

Commercial constant-deviation spectrometers are able in the 
visible region of the spectrum to achieve an accuracy of 1-3 A in 
wavelength. 

To obtain greater linear dispersion, a Littrow type of spectrom- 
eter or a grating spectrometer is used. 



5.18 the Littrow Prism Spectrometer 

The Littrow prism spectrometer can be adapted for use in the ultra- 
violet, visible and infra-red regions. Vacuum spectrographs of this 
type for use in the ultra-violet at wavelengths below 1,200 A are 
also made. 

The Littrow prism spectrometer is a double-pass instrument: 
dispersion is obtained by a triangular prism which is traversed twice 
by the light— which is thus doubly dispersed. It can be provided 
with a wavelength-calibrated drive which rotates the prism and can 
also be arranged so that it produces a spectrum on a photographic 
plate. 

For use in the ultra-violet down to 2,000 A approx., a quartz 
prism is necessary. When light traverses quartz, which is optically 
active, for each incident ray two circularly polarised light rays 



202 



UNIVERSITY OPTICS 

r 




Fig. 5.10. A Cornu prism 

emerge corresponding to slightly different refractive indices. In 
the Cornu prism (Fig. 5.10), this difficulty is overcome by making 
one half of the prism of right-handed quartz and the other half of 
left-handed. A single ray then emerges, because the phase difference 
introduced in the first half (prism XYZ) is balanced out by the equal 
and opposite phase difference introduced in the second half 
(prism ZYW). In the double-pass instrument of the Littrow type 



SCREW DRIVE 




SOURCE 

Fig. 5.11. A Littrow spectrometer (schematic plan view) 

used in the ultra-violet, an ordinary quartz prism is satisfactory 
and the Cornu model is not needed because the phase difference 
introduced when the light traverses the prism in the forward direc- 
tion is balanced out when the reflected light traverses the prism in 
the backward direction. 

The light from the collimator lens L (Fig. 5.11) traverses the 
prism P of angle a to emerge after deviation and dispersion to 
a front-surface aluminised plane mirror M at which it is incident 
normally. The light is therefore reflected back to traverse the prism 



SPECTROSCOPY 203 

again, so that it is additionally dispersed, to be focused at the eye- 
piece, exit slit or a photographic plate. Lens L acts as both the colli- 
mator lens and the telescope or camera objective. 

The entrance slit clearly needs to be outside the path of the light 
returning to the exit; it is therefore to one side of this path with 
a right-angled prism Q arranged to determine the path of the inci- 
dent light to the dispersing prism P. The mirror M is slightly tilted 
so that the returning rays pass over the prism Q. 

The prism table can be rotated by means of the wavelength 
drum D and the screw drive. ABC is a mechanical device which 
automatically keeps the prism in a position of minimum deviation 
for light of wavelengths which are focused at the eyepiece cross- 
wires, the exit slit or the centre of the photographic plate. The ties 
B and C to pivots H and K form mechanical links whereby the 
mirror Mis rotated through twice the angle of rotation of the prism 
when the drum D is turned. 

The plano-convex achromatic lens L is of glass in the visible 
spectrum or of quartz and fluorite in the ultra-violet. In some de- 
signs, however, L is replaced by an aluminised concave mirror. 
The advantage of the mirror is that it does not introduce chromat- 
ism and, unlike a lens, does not absorb radiation. A monochrom- 
ator (exit slit) or a spectrograph (photographic plate at the exit) 
with a quartz prism and a concave mirror instead of lens L able to 
cover the wavelength range from 35,000 A to 2,000 A is made in 
this way. The optical paths are necessarily different with a mirror in 
place of lens L. 

In a modification of theLittrow system, the right-handed compo- 
nent only of the Cornu prism is used. Component XYZ (Fig. 5.10) 
is used, and the face YZ is aluminised. Light entering the single 
right-handed component is then reflected back at face YZ. Then the 
clockwise circularly polarised vibrations incident upon YZ become 
anticlockwise components after reflection. These two components 
correspond to slightly different refractive indices, but the phase 
difference introduced in the incident path in the prism is balanced 
out by the opposite phase difference in the reflected path. 

A monochromator based on the Littrow system, having a quartz 
prism and concave mirror focusing and covering the wavelength 
range 35,000-2,000 A is often furnished with a Schwarz linear 
thermopile at the exit slit for recording in the infra-red and visible. 
In the ultra-violet and visible, this thermopile can be replaced by 
a small photomultiplier cell to obtain greater sensitivity but at the 
sacrifice of independence of sensitivity of wavelength. 

To explore longer wavelengths in the infra-red than 35,000 A, 
the quartz prism must be replaced by one of a different material. 
The choice is indicated by Table 5.3. All these materials are hygro- 
scopic and so their surfaces must be protected with a thin lacquer 



204 



UNIVERSITY OPTICS 





Table 5.3. 

CHOICE OF PRISM MATERIAL IN THE INFRA-RED 




Material 


Lithium 
fluoride 


Calcium 
fluoride 


Sodium 
chloride 


Potassium 
bromide 


Caesium 
bromide 


Caesium 
iodide 


Long 
wavelength 
limit (/tm) 


60 


90 


150 


25-0 


400 


500 



coating. A material known as KRS 5, which is 44% thallium bro- 
mide and 56% thallium iodide, is non-hygroscopic and can be used 
in the range 3-40 fan. 

To explore wavelengths in the ultra-violet below 2,000 A, the 
quartz prism is replaced by fluorite one which can be used for wave- 
lengths down to 1,200 A. Below 1,800 A, the spectrometer would 
have to be evacuated to avoid undue absorption of the ultra-violet 
in air at atmospheric pressure. 



5.19 The Grating Spectrometer 

For a plane diffraction grating at which light of wavelength X is 
incident normally, the equation 

e sin 6 = pi ((5.6) 

gives the angles of diffraction B at which principal maxima occur 
in the transmitted light for various orders p = 1,2,3,..., where e is 
the separation between corresponding points on adjacent rulings in 
the grating (Section 1 1 .6 of Volume 1). Each slit will produce a dif- 
fraction pattern in which the intensity distribution depends on the 
slit width. This combination of interference and diffraction effects 
has been considered analytically in Volume 1 (Section 11.6) but 
a restatement of results is useful here. 

Suppose collimated light of wavelength A is incident normally on 
a grating of parallel equidistant slits, each of width a much less 
than that of the opaque portions between slits. The separation 
between corresponding points on adjacent slits is e. The diffracted 
light is brought to a focus by a lens or the eye. The treatment of 
diffraction at a single slit gives the expression R e = (Ro sin «)/«, 
where R e is the amplitude of the light vibration emerging in a direc- 
tion at an angle to the incident direction, R is the amplitude for 
= 0, and a = (na sin 0)/L Therefore 

sin [(jta sin 0)/A] 
*• - •*• (na sin 0)/A 



SPECTROSCOPY 205 

For N slits, the common phase difference between neighbouring 
clear slits is (lire sin 6)jX and the resultant intensity I in the direction 
is decided by 

/= k sin 2 [(Nne sin 6)1 X] 
sin 2 [(Tie sin 8)1 X] 



where £ is a constant. When 

jre sin 



= /m 



the expression for / is indeterminate because the denominator is 
zero ; the amplitude is therefore found by differentiating both numer- 
ator and denominator with respect to 0. Now 



d /o,-„ Nne sin 6 \ Nne Nne sin 

d0 sin — x — = -r cos — ; COS d 



and 



d / . ne sin 0\ ne ne sin 

d0 I sin o I = — cos , cos 



/ . Tie sm 8\ 

r ~^") 

Therefore the amplitude in direction is proportional to 

iVcos [(Afaesin 6)1 X] _ N cos Npn _ 
cos [(ne sin 0)/A] _ ~~cospn ±N 

Hence the intensity / is proportional to N 2 . 

The resultant intensity in a direction is the consequence of 
both diffraction and interference terms. Therefore the intensity in 
direction is proportional to 

sin 2 [(na sin 6)/X] sin 2 [(Nne sin 6)/X] 
(na sin 0/A) 2 sin 2 [(ne sin 0)/A] 

The variation of the second factor, the interference term, is such 
that maxima occur when e sin = pX. This value can be substituted 
in the first factor, the diffraction term, to give the resultant inten- 
sities of the maxima as proportional to 

sin 2 (nap/e) 
(nap/e) 2 

where p = 0,1,2,... Calculation for a = 0-5e and a = (Me gives 
the results shown in Table 5.4. Both amplitudes and intensities are 
normalised to unity at p = 0. Note that when a = 0-5e, the second, 
fourth, sixth, ... orders of spectra disappear. When a = 0-le, 



206 UNIVERSITY OPTICS 

Table 5.4. 

AMPLITUDE AND INTENSITY CALCULATIONS FOR a = 0'5e AND a = (Me 







p = 


/, = / 


P = 2 


P = 3 


p = 4 


P = 5 




Amplitude 
decided by 


sinO 
~0~ ~ 


sin (na/e) 
(jia/e) 


sin (2na/e) 
(2xa/e) 


sin (Zxa/e) 


sin (4jia/e) 
(4*a/e) 


sin (Sna/e) 




Cina/e) 


(5na/e) 


6 

II 


Amplitude 
Intensity 


1 
1 


0-63 
0-40 






-0-21 
004 






0-13 
0-17 


o 

II 


Amplitude 
Intensity 


1 
1 


0-975 
0-95 


0-925 
0-86 


0-852 
0-73 


0-752 
0-57 


0-64 
0-41 



the tenth order disappears, also the twentieth, thirtieth, and so on. 
Diffraction gratings used in spectroscopy seldom have less than 
3,000 lines per centimetre. With this number N, e = 1/3,000 and 
Equation 5.6 gives 

sin 8 _ . 
3^00 "^ 

In the visible spectrum, the minimum value of I is 4xl0 -5 cm 
and sin can approach unity. The maximum possible order that can 
be observed in transmission is therefore 



/> = 



3X10 3 X4X10" 5 



Therefore p = 8, to the nearest smaller integer. With a = 0-le, the 
first missing order is the tenth. A grating with 3,000 lines per centi- 
metre and a = 0-1 e would therefore not have missing orders. 



5.20 Plane Grating in the Infra-red and the Ultra-violet 

Glass absorbs radiation of wavelengths greater than about 3 /*m 
(30,000 A) and less than about 0-3 pm (3,000 A). Although a quartz 
transmission grating could be used to extend this range, the best 
procedure is to employ a reflection grating. The reflection grating 
can be ruled on polished stainless steel or speculum alloy, but the 
usual practice is to make the ruling on optically polished glass and 



SPECTROSCOPY 



207 



DIFFRACTED IMAGE 
FORMED BY A LENS 



7 

DIRECTION OF 
SPECULAR REFLECTION 





Fig. 5.12. The plane reflection grating 

then coat it with an opaque layer of aluminium by deposition in 
a vacuum. 

In Fig. 5.12(a) suppose collimated light of wavelength X is inci- 
dent at an angle 6 on a reflection grating consisting of regularly 
spaced reflecting strips parallel to one another. Corresponding 
points on neighbouring strips are separated by e. 

Consider light reflected from neighbouring strips C and D in 
the grating AB at an angle -a to the normal. Constructive inter- 
ference occurs between these neighbouring parallel rays (when 
they are brought to a focus by a lens or by the eye) when, in Fig. 
5.12(b), 

ED-CF = pX 

where p is an integer, CE is perpendicular to ED so is a wavefront 
in the incident beam, and DF is perpendicular to CF and thus forms 
a wavefront in the reflected beam. Therefore 



i.e. 



CD sin 6-CD sin (0-a) = pX 
e [sin 6 - sin (0 — a)] = pX 



(5-7) 



208 



UNIVERSITY OPTICS 



The general condition when light is'diffracted on both sides of the 
specularly reflected image is 



e[sin 0±sin (6— a.)] = pi 



(5.8) 



In recent developments, rulings in the glass subsequently alumi- 
nised are cut at such an angle that most of the diffracted light is 
concentrated into one or two orders. They are called blazed gratings. 
Whereas small plane gratings of both the transmission and reflec- 
tion types are usually plastic replicas from a master ruled grating 
and are mounted on a spectrometer table for students' experiments, 
the large plane grating of the blazed type is mounted in a spec- 
trometer, monochromator or spectrograph of the Littrow pattern 
(Fig. 5.13). The quartz-fluorite achromatic lens L is used as both the 



L » 

k 



SLIT 



-^SOURCE 

Fig. 5.13. Littrow plane grating spectrograph 

collimator and camera lens in a Littrow spectrograph; the reflecting 
grating G is set at the appropriate angle, with appropriately angled 
blazed rulings to reflect the diffracted rays in the spectrum back to 
the lens L, which focuses them at the photographic plate P. An aux- 
iliary small quartz prism Q is used to enable the slit illuminated by 
the source to be placed to one side of the main light paths. 



5.21 The Rowland Concave Grating 

The use of a lens in the collimator and telescope or camera inevi- 
tably introduces absorption of radiation in the ultra-violet and infra- 
red regions. A valuable procedure, introduced by Rowland, is to 
rule the grating on a concave surface so that self-focusing can be 
achieved and lenses can be avoided. 

The original Rowland grating was ruled on a concave mirror of 
speculum alloy. In present-day technique, the ruling is made on 
a concave polished glass surface, the diamond point being set to 
give the required blaze angle, and the ruling is aluminised. The ruled 



SPECTROSCOPY 



209 



lines are equally spaced along a chord and not an arc. The grating 
is mounted with its rulings parallel to the slit (illuminated by the 
source of radiation), so that diffraction and focusing are achieved 
simultaneously. 

In Fig. 5.14(a), the grating G is greatly exaggerated in size relative 
to the circle (called the Rowland circle) for clarity. The radius of 




Fig. 5.14. The Rowland concave grating 



curvature of the grating is equal to the diameter NB of the circle. 
The points A, B, and C are on the grating. The circle touches the 
grating at its midpoint B and passes through S, the slit illuminated 
by the source of radiation. An image of the slit S is formed at P by 
reflection in the concave grating. As ABC is, in practice, small com- 
pared with NB, the points A and C on the grating may be taken to 
be on the circumference PNSB of the circle. 

Besides this reflected image at P, other images due to diffraction 
will be formed along the arc PS. 



210 UNIVERSITY OPTICS 

Light from the slit S reflected to P will appear to come from 
the direction MB. The grating can be regarded, in effect, as a trans- 
mission type relative to light along MB. For light to be diffracted 
to N, it is seen from Fig. 5.14(b) that 

e sin 6 = pX 

where e is the separation between corresponding points on neigh- 
bouring rulings, and p is an integer. 

In large-scale apparatus set up in accordance with Fig. 5.14(a), 
the concave grating would be mounted in a dark room, the slit 
would be illuminated by a shielded source, and photographic 
plates would be mounted along the arc PNS. This method was 
used in early studies of the fine structure of the spectrum of atomic 
hydrogen: NB was 21 ft (6-3 m), and photographic plates were 
erected along the arc PNS over a length of 29 ft (8-7 m). The rays 
fall obliquely on the plates and form an astigmatic image, so the 
line at S at right angles to the slit is not in focus at P. 

A grating of 6 in (15 cm) length will disperse the second order 
visible spectrum between red and violet over a distance of 12 ft 
(3-6 m) with apparatus of these dimensions, and lines 003 A apart 
can be separated. From Fig. 5.14(a) 

e sin 8 = e sin ^.PBN — pi 

The angle NPB is a right angle, so 

NP 
sin ^PBN = — 
r 

where r is the radius of the concave grating {NB). Therefore 

NP 

e — = »/. 
r 

from which it is seen that the distance NP is proportional to A in 
a given order p. 

Suppose r — 3-5 m and the grating has 6,000 lines per centi- 
metre. If the temperature changes by 1 degC, expansion of the 
grating will be such that de/e is about 2X 10 -5 deg -1 C, and there 
will be a corresponding error in the determination of A. Accurate 
measurements of wavelengths therefore require a fine control of the 
environment. 

Typical values of the dispersion for various sizes of grating and 
line spacing are given in Table 5.5. 



SPECTROSCOPY 211 

Table 5.5. 

DISPERSION VALUES FOR DIFFERENT GRATING SIZES AND LINE SPACINGS 



Radius of curvature of the grating 


Lines per \ First order 


Second 

order 

(A mm -1 ) 


(m) 


centimetre\ (A mm -1 ) 

1 


20 


6,000 


150 


7-5 


30 


10,000 


70 


3-5 


40 


10,000 


5-2 


2-6 


60 


6,000 


50 


2-5 


4-0 


18,000 


3-4 


1-7 


200 


12,000 


0-8 


0-4 



5.22 Mountings for Concave Gratings 

Three mountings adapted to the Rowland circle are due to Paschen, 
to Rowland and to Eagle. 



5.22.1 THE PASCHEN MOUNTING 

The grating and slit are fixed, and a photographic plate with thin 
glass backing is curved in a mount to fit any desired part of the 
Rowland circle. In Fig. 5.15 are shown the approximate positions 




ROWLAND CIRCLE 



THIRD ORDER 



SECOND ORDER 



FIRST ORDER 



ZERO 

ORDER 

(IMAGE OF slit) 

Fig. 5.15. The Paschen mounting 



212 



UNIVERSITY OPTICS 



occupied by the zero and first three orders in the visible spectrum. 
This mounting is used considerably, but it requires a large dark 
room with pier mountings for the slit and grating and a specially 
erected circular track — the Rowland circle — on which firm mount- 
ing of the photographic plate holders is necessary. 

5.22.2 THE ROWLAND MOUNTING 

The concave grating G [Fig. 5.16(a)] and the photographic plate 
holder P are mounted at a distance r apart on a rigid beam, r being 
the radius of curvature of G, equal to the diameter of the Rowland 
circle (shown dotted). As the beam slides on the swivel trucks 
attached below G and P and mounted on the rigid perpendicular 
rails AB and AC, G and P clearly remain at the fixed distance r 
apart and are maintained on the Rowland circle. 

Movement of G as its associated swivel truck is slid along AB 
will cause variation of the angle of incidence I of the light from the 
illuminated slit 5 mounted at the fixed junction A. On the other 



\ BEAM ON WHICH 

\ 6 AND P ARE 
MOUNTED 
\ 




Fig. 5.16. The Rowland mounting 



SPECTROSCOPY 213 

hand, the angle of diffraction 0, which is the angle between the 
diffracted rays reaching P and the normal to G along the axis of 
the beam between G and P, stays close to zero. 

In Fig. 5.16(b), let X and Y be adjacent rulings on the concave 
grating separated by distance e. For light of wavelength X, a maxi- 
mum at an order of interference p will appear centrally on the 
photographic plate P when XP = YP (which are each nearly 
equal to r) and the path difference between SY and SX equals pX. 
As ^.XSP is very nearly 90°, 

SY-SX= YN =e sin/ 

A maximum will appear at the centre P, therefore, when 

e sin / = pX 
As sin / = SP/r, 

3 SP 

A = e — 

pr 

In a given order/;, therefore, X is proportional to SP, which implies 
that, as S is fixed, the wavelength is very nearly proportional to 
the distance from some fiducial mark of the line recorded on the 
photographic plate (i.e. the spectrum is normal, to a close approx- 
imation). 

As sin / = pX/e, it follows that pX/e cannot exceed unity. From 
this it is easily calculated that for a grating with 8,000 lines per 
centimetre, the maximum wavelength observable in the first order 
is 12,500 A and 12,500/p in the pth order, where p = 2, 3, 4, .... 

5.22.3 THE EAGLE MOUNTING ^ 

A compact mounting similar to that in a Littrow spectrograph is due 
to Eagle ; it is much used in all ranges of the spectrum, but is partic- 
ularly convenient in the ultra-violet at wavelengths below 1,200 A 
(where the container of the spectrograph has to be maintained 
under vacuum); moreover, the compact container involved facili- 
tates temperature control. 

The concave grating G and the curved photographic plate P are 
mounted at a distance apart (which can be varied) on a beam sup- 
ported within the cylindrical container which can be evacuated, if 
required. Both G and P (which is curved) are on a Rowland circle 
(Fig. 5.17). The slit 5" is to one side of the beam within the container 
wall,(or the slit and source of radiation are inside a separate attached 
container which can be evacuated together with the main chamber). 



214 



UNIVERSITY OPTICS 



S 



// 

/ I 
> I 



/ 



X N ROWIANO CIRCLE 

\ ^(POSITIONS I AND 2) 



/ 



I 



/ 



I 



\ \ 

\ \ 

\ N G 

\ \ (POSITION 2) 



c- (position I)- 



■^ v. < £ 



(POSITION I) / / 




container 



- — -\- 



-p (position e) 



SOURCE 
Fig. 5.17. The Eagle mounting (plan view) 

The quartz prism Q directs radiation from the slit on to the grating 
G. In a vacuum spectrograph, Q is replaced by a plane mirror 
to avoid absorption of ultra-violet radiation. 

The virtual image of the slit S formed by the prism Q (or plane 
mirror) is at the centre of the photographic plate P. The grating, 
can be moved along the beam relative to the plate P. As it is moved, 
both G and P are rotated about a vertical axis so that they remain 
on a Rowland circle. 

The angle of diffraction 6 to the normal to the grating equals 
the angle of incidence / in this arrangement, so 

2d sin / = pX 



where p is an integer. As sin / has a maximum value of unity, the 
maximum wavelength observable is 2d/p, which is twice that pos- 
sible in the Rowland mounting. 

The more recent Wadsworth mounting (Fig. 5.18) for a concave 
grating does not make use of the Rowland circle idea. Instead, 
radiation from the slit S is collimated by the aluminised front- 
surface concave mirror M to be incident on the concave grating 
G which can be rotated to vary the angle of incidence. The centre 
of the curved photographic plate P is maintained on the normal to 
the grating G. As G is rotated, therefore, P has to be moved. 



SPECTROSCOPY 215 




Fig. 5.18. The Wadsworth mounting (plan view) 

With the compact Wadsworth set-up, spectrographs of larger 
aperture and improved definition (as a result of reduced aberration 
in the optical system) are possible, leading to more intense sharp 
lines at the plate P with the consequent improvement that exposure 
times required are reduced. 



5.23 High-resolution Spectrometry 

A spectrometer or monochromator based on the use of a prism 
is capable of a chromatic resolving power of about 50,000, so a 
spectral line of wavelength 5,000 A can be measured to an accuracy 
of 0-1 A approx. To achieve much greater resolution would demand 
excessively large prisms or a number of prisms in series to give a 
J arge value of b, the length of the base, in Equation 5.3. 

The use of a grating can lead to higher chromatic resolving power. 
Reference to Equation 5.5 shows that a large number N of rulings 
is required in the width of the grating encountered by the wave- 
front in the incident collimated light, and/or the spectrum must be 
recorded in a higher order p. Unless the grating were very large 
(demanding correspondingly a very wide uniform beam of incident 
collimated light) which would be very difficult and expensive to 
achieve, about the maximum possible value of N is 100,000. 
The limitation is determined by the number of rulings per centi- 
metre possible; the ruling is done by a diamond cutter attached to 
a dividing engine; the very small point on the diamond required 
to achieve an excessively fine ruling would wear unduly. In the 
second order (p = 2), a chromatic resolving power of about 
200,000 is possible with a ruled grating. 

In the visible spectrum, therefore, wavelength determination to 
0-1 A is possible with a prism spectrometer and possibly 0-025 A 
with a large grating spectrometer. In high-resolution work aimed 
at determining the hyperfine structure of spectral lines and the 



216 UNIVERSITY OPTICS 

measurement of line breadth, considerably greater accuracy is 
required. 

For this very high resolution, the instruments available are the 
Fabry-Perot interferometer (Section 4.2), the Lummer-Gehrcke 
plate (Section 4.8) and the echelon grating. The Fabry-Perot 
interferometer is usually placed between the collimator and the 
prism of a constant deviation spectrograph. Without the Fabry- 
Perot interferometer, the spectrograph will produce lines in the 
focal plane of its camera lens if the source gives an atomic spectrum. 
For two wavelengths A and A + AA in the radiation from the source 
which are so close together that AA is not resolvable by the prism, 
the two lines will appear as one. On insertion of the Fabry-Perot 
interferometer, very narrow bright rings alternating with dark ones 
result. Though the spectral lines appear as one, the ring systems 
on each are clearly distinguishable. Measurement of their diameters 
now enables AA to be determined. Provided that the line wavelengths 
are initially known to the nearest 0-1 A, the introduction of the 
Fabry-Perot interferometer enables the wavelength to be found to 
at least three decimal places. The resolving power now obtainable 
depends on the length of the etalon and the reflection coefficient 
of its surfaces (Section 4.2). 

Usually, the examination of the fine structure of a spectral line 
by the use of a Fabry-Perot interferometer necessitates isolation of 
the line by a prism or grating to avoid confusion. In some instances, 
however, a variable-length interferometer can be used to examine 
fine structure precisely without the use of a prism spectrometer, 
provided that the source of radiation is nearly monochromatic. 
When the interferometer separation is altered, the ring systems due 
to each component wavelength move relatively to one another. 
For example, measurement of the distance between successive 
coincidences of the ring systems due to each component of the 
yellow sodium doublet allows their difference in wavelength of 
5-97 A to be readily determined (Section 4.4). 

The Lummer-Gehrcke plate (Section 4.8) is an alternative to the 
Fabry-Perot interferometer. It is used between the collimator and 
prism of a spectrometer. As described in Section 4.8, the Lummer- 
Gehrcke plate is usually 130 mm long, 15 mm wide and has a 
thickness of less than 1 cm. Its resolving power is equal approxi- 
mately to the length of the plate in wavelengths. It produces sharp 
bright line interference fringes on a dark background, whereas the 
Fabry-Perot interferometer produces bright circular fringes. 

The development of the Fabry-Perot interferometer has ren- 
dered obsolete the Lummer-Gehrcke plate, which is more difficult 
to make. 

Another instrument of very high resolving power is the echelon 
grating, introduced by Michelson in 1898. The simple but ingenious 



SPECTROSCOPY 217 

idea put forward relates, in effect, to a consideration of Equation 
5.5 for the chromatic resolving power R of a diffraction grating: 

Whereas N cannot readily be made to exceed 100,000 and p is 
limited to three or four with plane gratings, Michelson's suggestion 
was to make N equal to 20^10 but increase p to several thousand. 
To achieve this, he used a pile of parallel plane plates staggered 
to simulate the steps of a staircase (the word echelon is the French 
for step). With 40 plates (i.e. JV = 40) and a step of 1 cm height, 
the order of interference p could be made 40,000 at a wavelength 
of 5,000 A, giving a resolving power of 40x40,000 (i.e. 1-6X 10 6 ). 
The echelon grating can be made as a transmission or a reflection 
type. The former is very difficult to make as it demands not only 
optically flat and plane-parallel blocks of glass but also the use of 
glass of exceptional homogeneity. As it cannot achieve as high a 
resolving power as the Fabry-Perot interferometer, the transmission 
type is obsolete. The reflection echelon grating forms, however, 
an important very high resolution instrument for work over a 
wide range of wavelengths from the ultra-violet, through the visible, 
into the infra-red. Though difficult to construct, homogeneity in 
refractive index of the glass is now not necessary. Also, being a 
reflection device, loss due to absorption in the ultra-violet and the 
infra-red is of no concern as it would be in, for example, a Fabry- 
Perot etalon constructed with quartz plates. 

5.24 The Reflection Echelon Grating 

An echelon grating is constructed from a number N of plane-parallel 
plates which each has the same thickness t. These plates have to be 
very accurately plane, so that they can be wrung together in optical 
contact to form a pile with an off-set step, the height of each step 
being s. The optical working demanded is of a very high precision 
to ensure that the plates will go into optical contact and that their 
thicknesses t are equal to within 0-05 X, where A is a mean wave- 
length of about 5,000 A. 

For a transmission echelon, it is essential that all the plates be 
cut from the same extremely homogeneous piece of optical glass. 
For the reflection echelon homogeneity in refractive index is un- 
important. Fused silica is usually employed for the reflection type, 
as it has a low thermal expansion coupled with a relatively high 
thermal conductivity compared with glass; these factors assist 
towards ensuring a high standard of optical polishing. 



218 



UNIVERSITY OPTICS 



SOURCE 



AIUMIMISE0 SURFACE 




10 PUMP 70 VARY 
PRESSURE IN CONTAINER 



(b) 
Fig. 5.19. The reflection echelon grating 



The pile of plates in a reflection echelon is supported in a mount 
within a container furnished with a window; the container can 
usually be connected to a vacuum pump so that the pressure 
(and hence refractive index) of the air surrounding the echelon 
can be reduced at will. After the plates have been mounted but 
before insertion in the container, the front-surface of the pile of 
step formation is aluminised. 

The reflection echelon is frequently set up in a Littrow type of 
arrangement. Light from the illuminated slit Si [Fig. 5.19(a)] 
is directed via the right-angled prism Q (or alternatively a mirror) 
on to the lens L. With Si in the focal plane of L, collimated light 
is incident on the aluminised front-surface of the echelon normal 
to the front-surface planes of the plates. The reflected and diffracted 
light from the echelon is focused by the lens L on to the slit 5" 2 of a 
spectrograph. 

Each 'mirror' of step width s acts like the reflecting ruling of a 
plane diffraction grating, and there are N such steps. In Fig. 5. 19(b), 
consider neighbouring parallel rays 1 and 2 in the incident beam, 
which give rise to parallel rays IR and 2R in the reflected light 
leaving at an angle of diffraction 6 to the normal. Ray 2 has to 
travel a distance t greater than ray 1 to reach the echelon surface. 
Ray 2R has to travel a distance / cos to reach the wavefront 



SPECTROSCOPY 219 

XY (drawn perpendicular to rays \R and 2R through the corner of 
a step). Ray \R has to travel a distance s sin from the reflecting 
surface to reach XY. The path difference between rays \R and 2R 
is therefore 

t+t cos 6— s sin 6 

For light of wavelength I diffracted through an angle to give a 
principal maximum in the focal plane of the lens L, this path 
difference must be an integral number of wavelengths p. Therefore 

f(l+cos 0)— s sin d = pi 

where p is an integer. As is a small angle, 

2t-sd = pi (5.9) 

Taking into account that the refractive index n of the air is not 
exactly unity and can be varied by varying the pressure, Equation 
5.9 becomes 

(2t-s6)n=pl (5.10) 

With a thickness / of 10 cm, for light of wavelength 5,000 A 
(5X10 _S cm) and order of interference p and for small angles 0, 
(i.e. the diffracted light being very near the normal), it is seen from 
Equation 5.9 that 

' = f = W^ 40 ' 000 

If there are 30 plates in the pile, N = 30, corresponding to 30 steps 
or 'rulings' in the grating. The resolving power R will therefore be 
(Equation 5.5) 

R = -pr = 4X10 4 X30= 1-2X10 6 
oA 

Assuming n = 1 and ignoring the dispersion of air, the angular 
dispersion d0/dA is obtained from Equation 5.9, which may be 
rearranged to give 

e = *- p A (5.H) 



Therefore 



d0 __£ 
dl~ s 



As p = 2t/l for small angles 0, therefore 

d0 __2f 
dl ~ Is 



(5.12) 



220 



UNIVERSITY OPTICS 



For t = 1-0 cm and s = 0-6 cm, the value of the angular dispersion 
d0/dA at and near the normal is therefore 2/(5 X 10 -5 X 0-6) = 66,667 
for light of wavelength 5,000 A. 

The angular separation between neighbouring orders is given 
by evaluating ddfdp and putting dp = 1. From Equation 5.11 



d9 
dp 



(5.13) 



The value of the angular separation between neighbouring orders 
is therefore Xjs, which is very small. The echelon can thus only 
be used to examine a narrow range of wavelengths at any one 




DISTRIBUTION OF 
LIGHT IN DIFFRACTION 
PATTERN FROM SLIT OF 
WIDTH s 



— V»- 



Fig. 5.20. The intensities of neighbouring orders observed with a reflection 

echelon grating 



time. This is ideal for the study of the hyperfine structure of lines 
or the splitting of lines in the Zeeman effect. The source of light 
to be examined — for example that illuminating slit Si in Fig. 
5.19(a) — is therefore best substantially monochromatic. 

As neighbouring orders have an angular separation of only Xjs, 
which is half the width of a diffraction pattern due to a face of 
width s, only two orders are observed with any intensity (Fig. 
5.20). 

The echelon grating can be regarded as one with a very effective 
'blaze', in that almost all the light giving rise to diffraction maxima 
is confined to small angles to the normal and within two orders. 
Harrison introduced in 1949 the echelle grating (Fig. 5.21) which 
is, in effect, a blazed grating half-way in operation between the 
ruled plane grating and the echelon. The echelle grating has wide 
shallow grooves designed to reflect light incident at an angle 
usually exceeding 45°, but normal to the shallow side of the step. 



SPECTROSCOPY 



221 



SOURCE 



PRISM OR MIRROR 




ECHELLE 
GRATING 



Fig. 5.21. The echelle grating in a Littrow arrangement (the grating has 40-400 
steps per centimetre, and step depths in the range 006-0-25 mm) 



5.25 Standards of Wavelength 

The absolute standard is the metre, defined as 1,650,763-73 wave- 
engths of a particular line in the orange of the spectrum of a 
krypton 86 discharge lamp operated at low temperature (Section 
4 6). ine primary standard of wavelength is therefore the krypton 
86 orange line, determined in relation to the metre to an accuracy 
of one part in 10 8 . J 

Other primary standards of wavelength used prior to that of the 
zS l ?iL V adlatlon are the cadmium red line of wavelength 
5460-740 A aDd thC merCUry 198 ^ eea line of wavelength 

Precision measurements of other wavelengths are made by com- 
parison with a primary standard, usually by means of a grating 
spectrometer and a Fabry-Perot etalon spectrometer. From such 
comparisons, useful secondary standards of wavelength have been 
established In particular, the spectra of iron, neon and krypton 
are valuable. 

Thousands of lines of wavelengths between 2, 101 A and 10,216 A 
in the arc spectrum of iron have been measured. With the operat- 
ing conditions of the iron arc carefully specified, 306 secondary 
standards of wavelength between 2,447 A and 6,677 A have been 
adopted. 

The neon spectrum is especially rich in red lines, several of which 
are used as secondary standards. The wavelengths in the spectrum 
oi natural krypton are useful secondary standards as they agree 
to within one part in 10« or better with those from the isotopes 
krypton 84 and krypton 86, the last of which is the fundamental 
standard. 



222 UNIVERSITY OPTICS 

5.26 The Calibration of a Spectrometer 

In the ultra-violet region between 4,000 A and 2,000 A, a number of 
lines of known wavelengths are employed for calibration of a 
spectrometer or spectrograph. In Angstrom units to four signif- 
icant figures, where the element giving the spectrum is noted in 
brackets, these lines are 3,902 (molybdenum), 3,719 (iron), 3,515 
(nickel), 3,453 (cobalt), 3,067 (bismuth), 2,881 (silicon), 2,496 
(boron) and 2,348 (beryllium). . . 

For calibration in the visible region, the usual choice is the 
cadmium lamp spectrum in which convenient lines occur at 6,438 A, 
5,086 A, 4,800 A, 4,678 A and 4,413 A. 

If the spectrum to be determined is photographed beside that 
of the cadmium spectrum on the same photographic plate, a travel- 
ling microscope or comparator is used to measure the distance 
x' of the line of unknown wavelength I' from, say, the cadmium 
line at 6,438 A. The Hartmann dispersion equation is 

(5.14) 

where A is a constant wavelength, and A and B are constants for 
a given spectrograph. These constants A, B and A are found by 
determining x for three known lines in the cadmium spectrum, 
corresponding to three known values of L For the line of wave- 
length X, x = x' and use of Equation 5.14 in which A and B are 
now known enables X' to be found. 

In the infra-red region, a convenient practice is to determine 
the irregularities in an intensity-wavelength graph for the energy 
received by a spectrometer (with thermopile recording, for exam- 
ple) from a continuous source of radiation. These irregularities 
are caused by carbon dioxide and water vapour, which absorb 
specific wavelengths. 

5.27 Calibration of a Spectroscope by the Edser-Butler Method 

A convenient simple method for a calibrating a spectroscopy n 
the visible region is due to Edser and Butler. It is capable of an 
accuracy of 3-5 A. Though the procedure forms an interesting 
students' laboratory experiment, it is only seldom used in spec- 
troscopy. . . 
A slit S ± is illuminated by white light, and an image of it is 
formed by a lens L on the slit S 2 of the spectroscope [Fig. 5.22(a)]. 
A continuous spectrum is observed in the telescope of the spectro- 
scope. The Edser-Butler arrangement consists of two plane glass 



SPECTROSCOPY 



223 



plates placed one on top of the other but separated by a small 
distance d by means of suitable spacers. A suitable device for a 
students' experiment is to use two small sheets of plate glass 
separated by strips of paper at their edges and held in a simple 
clamp or holder. In order to render these plates parallel, a distant 
small illuminated aperture is viewed through the plates which 
are then pressed together until the aperture and its images coincide. 







fa) 


EDSER-BUTLER PLATES 




- s, 






* '^r^~ 


1 1 "" ■ — 




1 


\ i 


r 


i . ^ 




— 71 


\ 




^ SOURCE OF 
WHITE LIGHT 


(perpendicular to plane of diagram) 




S 2 (IN PLANE OF DIAGRAM) 


LIGHI FROM WHIlt SOURCE VIA \ | 




EDSER-BUTLER 


PLATES . 

y 




^-^^ 






RIGHT ANGLED 
PRISM 


1 








LIGHT FROM 
STANDARD DISCHARGE 
TUBE 



(b) 
Fig. 5.22. Use of the Edser-Butler method for calibrating a spectroscope 

The preferable commercial instrument consists of a thin air film 
between two parallel semi-aluminised plates in a holder with three 
adjusting screws: in effect, an etalon with a very small length d. 
The Edser-Butler plates are usually placed near the spectroscope 
slit within the beam of incident white light. The angle of incidence 
equals the angle of refraction R within the air film. With the plates 
normal to the incident beam, the angle R is zero. From the study 
of the interference of light in plane parallel-sided plates (Section 
9.1 of Volume 1) in the transmitted light, for a maximum 



and for a minimum 



2nd cos R = pi 

2nd cos R = (p + i)?, 



(5.15) 
(5.16) 



224 UNIVERSITY OPTICS 

where/; is an integer and«is the refractive index of the parallel-sided 
film between the plates. The refractive index n — 1, as the film is 
air. 

There is a continuous range of values of A in the radiation from 
the white light source. At any wavelength in this continuum at 
which Equation 5.16 is satisfied, a dark band will appear across 
the spectrum in the field of view of the telescope of the spectro- 
scope. On the other hand, when Equation 5.15, is satisfied there 
will be a maximum, giving a bright band. The continuous spectrum 
observed is therefore crossed by a series of parallel dark bands 
alternating with bright ones perpendicular to the direction in which 
the spectrum is traversed. 

Let X r be a wavelength in the red end of the spectrum at which 
a bright band occurs. There will be a series of wavelengths X\, A 2 , 
2.3, . . . , l k (all shorter than X r ), at which bands numbered succes- 
sively 1, 2, 3, . . ., k will occur. From Equation 5.15, putting n = 1 
and R = 90°, 

2d = p,X r 

where p r is the order of interference producing the bright band at 
wavelength 2 r , and 

Id = p k h 

where p k = p r +k, and X k is the wavelength corresponding to the 
kth band. Therefore 

2d = p r l t = p k X k = (p r +k)A k 
Therefore 

K = h^ 

Pr 

But />, = 2djX r . Therefore 

. . kX r l k 
K - *k + -2j- 



Thus 

h ~ K ' 2d 



Ilk 

T~ = T- + ^i (5-17) 



or 



h = iv + ^ (5.18) 

where v = 1/A is the wave number. As 2d is a constant and k is an 
integer, it is seen that successive bright bands (and correspondingly 
intervening dark bands) cross the spectrum at equal intervals of 
wave number v. 



SPECTROSCOPY 225 

It is necessary to select two known wavelengths: one X r (prefer- 
ably in or near the red) and the other X k = X b (near or in the blue) 
in order to determine the wavelengths X k at which intermediate 
bright bands occur, where k = 1, 2, 3, .... The procedure is to 
observe two spectra at the same time. One is the continuous spec- 
trum interrupted by bands due to the Edser-Butler plates and 
brought about by white light illuminating the top half of the 
spectroscope slit. The other is a line spectrum (from a discharge 
tube) containing lines of two known wavelengths X r and X b of the 
light which illuminates the bottom half of the slit. Fig. 5.22(b) 
shows a typical method of achieving this dual observation. The 
lines of known wavelengths X, and X b will not, in general, coincide 
exactly with positions of maxima in the Edser-Butler fringes, so it 
is necessary to estimate a fraction of a fringe in estimating the 
value of k in Equation 5.17 which corresponds to where X k = X r 
or X b . 

Given specific values of X r and X b separated by a counted number 
of fringes k, substitution in Equation 5.17 gives d, the thickness 
of the air-film between the Edser-Butler plates. With the collimator 
and prism positions fixed, the telescope can now be rotated from 
the position where light of wavelength X r is focused on the eyepiece 
cross-wires to the successive bright bands corresponding to k = 1, 
2, 3, .... At each position, the angle of setting of the telescope is 
noted against a divided circle and the corresponding wavelengths 
Xk = Ai, Xi, X 3 , ... are calculated by putting k = 1, 2, 3, ... 
successively in Equation 5.17. Alternatively, the collimator and 
telescope positions can be kept fixed, and the angle of setting of 
the prism plotted against wavelength. 

The method also lends itself readily to a linear plot of various 
values of v k , the wave number, against k, as is seen from Equation 
3.1o. 



5.28 Checking the Chromatic Resolving Power of a Spectrometer 

In the visible spectrum, a number of elements give spectra contain- 
ing convenient doublet lines useful in making a quick approximate 
check on the resolving power of a prism or small grating spectrom- 
eter. These doublets are given in Table 5.6. 

5.29 Spectrophotometry 

A spectrophotometer is an instrument for measuring the distri- 
bution of the energy of radiation through part of the spectrum. 
This energy may be directly from a source of radiation, or from a 





■< 


. 


PS 


SO 


H 




U 




H 


a 


0- 
1/1 


.fi 




£ 







Tj- m 


OS 


c 


^ <^H 


6s 


o 


on r- 


**•< 


t-4 






HH 


>n in" 




1 


\© sp 


© 




en »N 

oo r- 


~ 


«j .« 






s 


<n in 






<N (N 


o 


c 


on A 


00 


o 






SO SO 




>— 1 


^■* 




E 

_3 


ON ON 


© 


»n on 


SO 


ON 00 




■3 


OO 00 




o 


in <n 




VX 








<n so 


OS 


a 
o 


in on 


VI 


c*1 <N 




■* ■* 




HH 


<n in 




E 


o <-< 


OS 


3 


© *b 


rr> 


'5 

03 


CN ^h 




qq 




.^; 


in <n 




H 








»n in 


© 


c 


on ^b 


f^i 


o 


so so 




<N (N 




HH 


«n in 




E 


■^ Tf 


o 


3 


00 SO 


m 


'3 

os 


m m 




°* „ 




+i 


in in 




H 








so \o 


o 


c 


r- so 


*i. 


o 


ci en 




J- 






KH 


\o" so" 




E 


Tf ON 


W"> 


3 


sb in 


6 




ci c*i 




cd ©, 




.^ 


«n in 




H 








/■"■s 






V 5 


^-s 




° «, 


°< 




&a s> 






•* .S 






§"S 




s 


J -Si 


« 


*u 


oj «C 


^. 


5 


s a 


(3 


,« .S 


CO 



SPECTROSCOPY 227 

surface which reflects or scatters radiation directed on to it from 
a primary source, or from a material (solid, liquid or gas) which 
transmits radiation directed on to it. In general, a spectropho- 
tometer does not measure the energy in a given narrow region of 
the spectrum in absolute units. Rather, the purpose is to compare 
outputs obtained when first the radiation concerned is incident 
directly on the spectrophotometer slit and then record this output 
when a reflecting surface or absorbing material is placed between 
the radiation source and the slit. 

The output is measured photographically, or by some kind of 
iPhototransducer which converts the radiation usually into an 
electrical signal, the commonest form of such transducer being a 
photocell. 

Usually, the radiation incident upon the slit of the spectropho- 
tometer is divided into two parts : directly or otherwise from the 
luminous source; and after reflection from a surface or transmission 
through a material. There are consequently two simultaneous 
output signals from the spectrophotometer and these are compared. 

The spectrophotometer must be based on a dispersive system 
adapted to the spectral region concerned. Much work is done with 
prism and grating instruments in the visible spectrum, but instru- 
ments for operation in the infra-red and ultra-violet are made and 
continual development has been undertaken and is still in progress 
to enable the far infra-red and far ultra-violet regions to be quanti- 
tatively explored. 

In the visible region, the reflectance at various determined wave- 
length intervals in the spectrum (i.e. the spectral reflectance) from 
the surface of a sample of opaque material is frequently determined 
by comparing with a suitable spectrophotometer the amount of 
light reflected by the sample with that from a standard white surface 
which reflects all the light incident upon it. 

The most important application is absorption spectrophotom- 
etry. In general terms, the purpose is to measure the transmission 
of a transparent material for radiation in various selected regions 
of the spectrum. Quantitative work is then based on two laws : 
Lambert's law and Beer's law. 

Lamberts law relates the transmitted intensity / to the incident 
intensity I in terms of the length / of the radiation path through 
the absorber and the absorption coefficient fi of the material. 
This law is 

7=/ exp(-p/) (5.19) 

Beefs law states that 

7=7 exp(-c/) (5.20) 

where c is the concentration of an absorbing solution. 



228 UNIVERSITY OPTICS 

These two laws are usefully combined to give 

/ = / exp(-^ (5.21) 

where K is another constant which depends on d, the size of any 
particles in suspension in a solution, and the wavelength X of the 
radiation. Except when d is very small, Equation 5.21 becomes 



or 



I = I exp{-K 1 d) (5.22) 

K lC l = log e ^ (5.23) 

The term log e (/ //) is known as the absorbance, and K^ is the 
molar absorptivity. 

Automatic self-balancing spectrophotometers readily measure 
the transmittance I/I , which is an inconvenient measurement for 
subsequent use as it is not related linearly to concentration and the 
length of the cell containing the solution. Modern instruments mea- 
sure the absorbance, which is related directly to Ki, c and /. 

Deviations can occur from the Lambert-Beer law when light is 
scattered as well as absorbed by the material and also when fluo- 
rescence is induced in the solution by the incident radiation. Devia- 
tions can be experienced which depend on the concentration of the 
solution : thus Beer's law may not be valid if partial or complete 
dissociation of the solute occurs. 



5.30 The Use of Photographic Emulsions in Spectrography and 
Spectrophotometry 

A much used procedure is to record on a photographic plate in 
a spectrograph the spectrum of the radiation from a source. The 
optical density of the silver deposit in the subsequently processed 
emulsion is then recorded at various known wavelengths through- 
out the spectrum. This optical density is measured by a micro- 
photometer. 

This kind of technique can be used in two main ways. In the 
first, the spectrum of the radiation from a luminous source (which 
may, for example, be the continuous spectrum of the light from 
a tungsten lamp or the line spectrum from a discharge lamp or an 
arc) is recorded; on the same photographic plate is recorded (usu- 
ally simultaneously, by means of a beam-splitter device in front of 
the spectrograph slit) the spectrum resulting when this radiation is 
transmitted through an absorbing material. From the measurement 



SPECTROSCOPY 229 

of the two sets of optical densities of the photographic deposit 
at various wavelengths, the transmittance and absorbance at these 
wavelengths of the absorbing material may be found. In the second 
method, typical practice is to photograph the arc or spark spectrum 
for a metallic alloy 1 having a known constituent (for example, 
a known percentage of aluminium in a steel). The arc or spectrum 
of a second metallic alloy 2 in which the percentage of, say, alumi- 
nium is unknown is also photographed. By comparing the mea- 
sured optical densities at certain wavelengths in the two spectra, 
the percentage of aluminium in the second alloy can be found. 

To examine the validity of the use of photographic emulsions 
in spectrography and spectrophotometry, an outline of a theory of 
the photographic process due to Mott and Gurney in 1938 is given. 

Photographic emulsions, consisting essentially of silver halide 
in gelatine, are more sensitive to light if they contain sulphur com- 
pounds (thiocarbamides). Thus, the formation of silver sulphides 
in the silver bromide grains in the emulsion seems possible. Silver 
bromide has an ionic lattice structure. When exposed to light, mo- 
bile electrons are liberated from the bromine negative ions by the 
incident photons, the reaction being 

Bt~ + Fiv -* Br+e- 

where v is the frequency of the incident radiation and h is Planck's 
constant. These electrons will be trapped near the sulphides. The 
silver sulphide spots are therefore negatively charged. 

Positive silver ions Ag + (smaller than bromine ions) that have 
been displaced by thermal processes are mobile and move towards 
the negatively charged sulphide spots. On arrival, collections of 
silver atoms are formed around the silver sulphide spots in the 
grains; these spots grow as the process proceeds. The reaction is 

Ag++e- - Ag 

The latent image is thus formed. When the exposed photographic 
emulsion is placed in the chemical developer (a mild reducing 
agent), electrons are supplied to the silver nuclei. The spots become 
charged negatively, and so more positive silver ions are discharged 
from the grain. Development increases the free silver in the exposed 
grains by a factor of about 10 5 . The emulsion is fixed after develop- 
ment and washing by immersion in 'hypo' — sodium thiosulphate — 
which reacts with the residual silver bromide but affects only slight- 
ly the black silver in the exposed grains. The complex argentothio- 
sulphates formed are more or less unstable and are subsequently 
removed from the emulsion by washing, leaving behind the black 
silver deposits which are denser the greater the initial incident light 
exposure. 



230 UNIVERSITY OPTICS 

With low incident light intensities, the modest supply of photons 
means that the rate of arrival of electrons will be small at the sensi- 
tive silver sulphide spots, and these spots will hardly change because 
they lose electrical charge by leakage. At the other extreme, with 
high light intensities, large numbers of electrons are produced by 
the plentiful photons, but each sulphide spot can take only a certain 
charge at a time because of its small capacitance. Between these two 
extremes, the influx of electrons takes place at a rate such that they 
are all captured in silver sulphide spots. The developed and fixed 
plate will then exhibit blackened silver deposits in which the degree 
of blackening is directly proportional to the exposure. 

The exposure is the product of illumination E and the time t. 
The emulsion blackening is expressed by its optical density D: 

where h is the incident light intensity, / is the transmitted light 
intensity and h\I = opacity O (the reciprocal of the transmittance). 
If a film transmits half the light intensity incident upon it, the opac- 
ity O = 2 and D = 0-3. Similarly, films which transmit 0-1, 0-01 
and 0001 of the incident light have opacities of 10, 100 and 1,000 
and optical densities of 1-0, 20 and 3-0 respectively. 

The optical densities are measured photometrically (the instru- 
ment used in spectroscopy is the microphotometer). If these densi- 
ties are plotted against the logarithm of the exposures (Et, where 
E is the illumination and / the time) that have produced them, the 
characteristic curve (blackening or sensitometric curve) for the 
photographic emulsion concerned is obtained. Such curves have 
three distinct regions (Fig. 5.23): the curved portion with under- 
exposure, the linear region with correct exposure, and the markedly 
curved region with over-exposure leading to reversal or the so- 
larisation region. 

In the central linear region over which the exposure is correct, 
the equation of the characteristic is easily seen to be 

D = y (log Et- log/) ( 5 - 24 ) 

where y = tan a is the tangent of the gradient of this linear section. 
The term y is related to contrast, but y is not the only deciding 
factor. As shown in Fig. 5.23, the inertia i is the intercept on the 
log Et axis, decided by the tangent to the straight section of the 
characteristic. For fast photographic emulsions, which are very 
sensitive to light, y is about 1-4; for slow insensitive emulsions, y is 
about 3-2. 

The characteristic curve is determined in an internationally adop- 
ted manner. The light source is a tungsten filament lamp operated 



SPECTROSCOPY 



UNDER- 
EXPOSURE 




FOGGING — — __ 

INERTIA log Et 

Fig. 5.23. Characteristic curve for a photographic emulsion (negative material) 



at a colour temperature of 2,360°K and screened with a filter to give 
the light the quality of noon sunlight (colour temperature, 5,400°K). 
(The colour temperature of an incandescent source is the tempera- 
ture at which a black-body radiator would have to operate in 
order to match visually the light from the source.) A set of graded 
exposures is produced by using various times t with constant illu- 
mination E, and the optical density D is determined photometri- 
cally. 

The constants y and i will be specific values for light specified 
in this way. However, both y and i will vary considerably with 
wavelength l. The value of y for the emulsion will depend on the 
nature of the developer and on the time and temperature with 
a given developer. As development time increases, y increases to 
a limited maximum value y max . The inertia / remains unchanged 
when bromide-free developers are used, but varies with different de- 
velopers. 

Light absorption takes place over the whole silver bromide grain. 
A grain must absorb at least 100 photons to become developable. 
Photographic materials differ in graininess: this increases with fast 
emulsions, which are very sensitive to light. The resolving power of 
an emulsion is measured by the number of lines per millimetre which 
can be distinguished in the processed negative. Values range from 
40 lines to 50 lines per millimetre for fast emulsions, and reach sev- 
eral hundred lines per millimetre for slow emulsions. 

The effect of Et, the exposure, is a constant for some photo- 
chemical reactions ; i.e. if E is multiplied by n and if t is divided by n, 
the exposure Et is the same. This is known as Bunsen andRoscoes's 
reciprocity law, enunciated in 1862. For photographic emulsions, 



232 UNIVERSITY OPTICS 

the law is Et p = constant, where p is called the Schwarzchild expo- 
nent. For small values of E, p = 0-8, but varies with the emulsion. 
Halving the illumination E and doubling the exposure time t, 
therefore, does not result in the same optical density of the silver 
deposit in the processed emulsion. This is known as reciprocity 
failure. The exponent/? diminishes the effect of time in exposure; 
with low values of E, correspondingly longer times t are necessary. 
Thus, if a photographic record cannot be obtained in 2 h, little is 
gained by doubling the exposure time; an increase by a factor of 
about 10 is needed. For much ordinary photography, however, the 
reciprocity law can be considered to be true; it is only with very 
high levels and very low levels of illumination of the photographic 
emulsion that reciprocity failure presents a serious source of diffi- 
culty. 

Other problems in photographic recording are: shrinkage of 
film on development (plates are therefore preferred); turbidity 
whereby an emulsion on development displays a variation in optical 
transmission over non-exposed portions (this appears as chemical 
fogging); agitation of the developer is important in processing to 
avoid local variations across the emulsion in developer concentra- 
tion. 

This brief account of the photographic process clearly stresses 
the need for very accurate control in the use of photographic plates 
in spectrophotometry. Even with the best procedure, it is not easy 
to ensure accuracy to better than ±5%. The factors which demand 
careful regulation are : 

1. Selection of photographic plates of appropriate speed, y and 
spectral sensitivity. In any comparison between recorded 
spectra, it is preferable to record the spectra concerned along- 
side one another on the same plate because, in general, there 
will be too great a variation of characteristics between one 
plate and another, even amongst those from the same box of 
plates. 

2. The exposure must be arranged to ensure as far as practicable 
that the linear portion only of the characteristic curve for the 
emulsion is utilised. 

3. The development must be carefully standardised as regards 
choice of developer, temperature at which development is un- 
dertaken and time ,of development. Agitation should be prac- 
tised in a regulated manner. 

4. The subsequent washing, fixing and final washing procedures 
must be regulated, but the demands on control are not so severe 
as in development. 

With such careful attention to photographic procedures, it is 
clearly possible to determine the absorption spectrum of a material 



SPECTROSCOPY 



233 



or to compare the emission spectra from various luminous sources. 
The photographic method in spectrophotometry is chiefly of value 
in the visible region of the spectrum and the near ultra-violet. 

5.31 The Hilger-Nutting Spectrophotometer 

The Hilger-Nutting spectrophotometer (Fig. 5.24) is used chiefly to 
measure the absorption of visible light of various wavelengths in 
a liquid. It employs polarisation methods to compare the intensities 
of two beams of light. It is used in conjunction with a spectrometer, 
the combination forming a spectrophotometer. 




TO 

SLIT OF 

SPECTROMETER 



Fig. 5.24. Optical system of a Hilger-Nutting photometer (beams reaching 

R from A and B are polarised in planes at right angles to one another by the 

two Nicol prisms N t andN t ) 



Light from the source S is deviated into two parallel beams 1 and 
2 by the achromatic prisms Pi and P 2 . Beam 1 traverses absorption 
cell A, whereas beam 2 traverses an identical cell B. A and B are 
usually glass cylinders terminated by plane-parallel end windows 
and can be filled with the fluid to be examined. Often cell A is 
empty (contains air) and cell B is filled with the liquid of which an 
absorption study is required. 

Beam 1 traverses Nicol prism N u whilst beam 2 traverses Nicol 
prism N 2 ; these Nicols are set so that light in beam 1 emerging from 
Ni is polarised in a plane perpendicular to that of beam 2 emerging 
from N 2 . Subsequently in the optical paths, the rhomboidal prism 
R brings the beams 1 and 2 close together but so that they can still 
be seen separately. These adjacent beams are collimated by lens 
L x before they traverse the analysing Nicol prism JV 3 , which is 
mounted within a divided circle so that its rotation in degrees can be 
determined. The lens L 2 focuses the exit edge of the rhomboidal 
prism R on the entrance slit of the spectrometer (usually a constant- 
deviation instrument). This edge forms the dividing line between 
the two adjacent beams as seen in the eyepiece of the spectrometer 
telescope. 



234 UNIVERSITY OPTICS 

The intensities of the two beams 1 and 2 are equal before they 
enter cells A or B. Let this intensity be J . Within cell A, the inten- 
sity of beam 1 is reduced to h; within cell B, the intensity of beam 
2 is reduced to h- Suppose is the angle through which the analys- 
ing Nicol prism N 3 is rotated from the direction of the linearly polar- 
ised light emerging from Nicol prism Ni in beam 1. The intensity 
of beam 1 after emerging from N 3 is therefore decreased to h cos 2 
(absorption of light in the Nicol prisms and the rhomb R being ne- 
glected). The intensity of beam 2 after emerging from N 3 will be 
increased from zero (when 8 = 0, N 3 is crossed with respect to 
Ni) to h cos 2 (90° -0) = h sin 2 0. The Nicol N 3 is rotated until 
these intensities are equal. This observation of equality is made at 
a given wavelength selected by the spectrometer through which 
the intensities are observed. At equality, 

h cos 2 6 = h sin 2 
or 

~ = tan 2 
Where absorption cell A is empty, h = I and 

The divided circle attached to the analysing Nicol prism N 3 
is usually marked both in degrees and also in optical densities D, 
where 

D = logio ~ 
H 

A curve can be plotted of the transmittance/ 2 /^o against wavelength 
(as set by the spectrometer), or of optical density D against wave- 
length. This will give the absorption spectrum of the liquid in cell B. 
This instrument can be adapted to measure at various wave- 
lengths the transmittance (or optical density) of a transparent solid 
(such as a light filter) by placing a plane-parallel slab of the solid in 
beam 2 in place of absorption cell B and omitting cell A. Also, if 
prisms P\ and P 2 and cells A and B are removed, the intensities of 
two sources of light can be compared at various wavelengths 
through the spectrum. 



5.32 Modern Spectrophotometers 

The technology of the several kinds of advanced spectrophotome- 
ters and absorption spectrophotometers used in modern practice 



SPECTROSCOPY 235 

in the infra-red, visible and ultra-violet regions is beyond the scope 
of this book. For example, double-beam diffraction grating record- 
ing spectrophotometers with photoelectric detectors, amplifiers and 
pen recorder output giving the absorption spectrum directly make 
use of automatic drive of the grating setting in conjunction with 
a servomechanism to control the slit width. 

A Fery prism of quartz which has a concave entrance-surface 
and a fully alumimsed concave back-surface is used in the 'Unicam' 
prism absorptiometer (Fig. 5.25). This type of prism is self-focus- 
ing. It is mounted on an arm coupled to a drum directly calibrated 
in wavelengths. Rotation of the drum orients the prism to a position 



PRISM MOVEMENT 

\ 



PRISM ARM COUPLED 
TO DRUM DRIVE 




BEAM ATTENUATORS^ ^\ V 4 ' 1 ftffilff 



CONDENSER LENS 
ENTRANCE SLIT 
SAMPLE 

PHOTOCELL 

Fig. 5.25. Optical paths in a Unicam prism absorptiometer 

at which the radiation traversing the exit slit is monochromatic of 
known wavelength. This monochromatic radiation passes through 
the sample or a standard. The sample is a liquid in a parallel-sided 
cell, often of 1 cm length, or it may be a plane-parallel slab of a sol- 
id. The standard is usually a pure solvent of known absorption 
characteristics. The light emerging from the sample is incident 
upon a photocell. The ratio of the outputs from the photocell (which 
must have a linear response) with and without the sample in posi- 
tion gives the transmission of the sample at the selected wavelength 
Spectrophotometry practice in the near infra-red is illustrated 
m outline by Fig. 5.26. Radiation from a Nernst filament is dis- 
persed by a quartz prism (for wavelengths exceeding 35 000 A 
a prism of a material selected from those listed in Table 5 3 may 
be used) which can be rotated so that a small range of wavelengths 



236 



UNIVERSITY OPTICS 



passes through a slit of adjustable width. The beam of monochro- 
matic light thus obtained is incident upon a beam-splitter (a thin 
pellicle of plastic material vacuum-coated with a semi-transparent 
aluminium layer may be used) so that one half of the radiation 
passes through a reference optical density to detector 1 and the 
other half via a mirror through an absorption cell to detector 2. 



NERMST 
FILAMENT 





BEAM-SPLITTER 


OPTICAL WEDGE 


rxr" 


•\ • 


V 1 DETECTOR 


J^y* 


— r 


\ ' 


1 L ' 


sf 








\ 




| DETECTOR 


\ 


\ - 






1 Z 


\ 


\ ^ 


ABSORPTION CELL 




QUARTZ 
PRISM 
ROTATABLE 


SLIT OF 
VARIABLE WIDTH 







Fig. 5.26. Outline of arrangement for spectrophotometry in the near infra-red 

These detectors are Golay air cells. The reference optical density 
may be a neutral optical wedge (a plate along which the optical den- 
sity varies with distance in a known manner and independently of 
wavelength) or a rotating sector. 

Automatic recording may be introduced by rotating the prism 
by a cam mechanism, while at the same time switching the detectors 
at a frequency of 10 Hz (10 c/s); the difference in intensity between 
the sample and the reference beam then gives a proportional 10 Hz 
alternating output e.m.f. The neutral optical wedge may be circular 
and rotated by a servomechanism so that there is no difference in 
output between the two detectors. The servomechanism also con- 
trols the position of a pen recorder or a print-out mechanism; thus, 
as the prism is rotated by the cam mechanism to produce a change 
in wavelength, the transmittance of the sample is recorded continu- 
ously on paper. 

Indicative of the sophisticated techniques used in modern prac- 
tice in the far infra-red region of the spectrum is the outline given 
in Section 5.36. 



5.33 Microphotometers 

A microphotometer or microdensitometer is used for such work 
as the measurement of the optical densities of the black lines on a 



SPECTROSCOPY 237 

photographic negative which has been exposed in a spectrograph. 
The variation of density over the plate is large over very short dis- 
tances. In a microphotometer, there must therefore be produced a 
beam of light of very small cross-sectional area. This beam is passed 
through the photographic negative, and the reduction in its inten- 
sity due to the silver deposit in various locations in the negative is 
measured, usually by a photocell. 

To provide the narrow beam of light, a microscope objective is 
used. Frequent practice adopted is to project via this objective an 
image of an illuminated slit on to the photographic negative. The 
light transmitted through the negative impinges on a phototrans- 
ducer (often a photovoltaic cell or a phototransistor) placed close 
to the negative. The beam of light is usually vertical, with the 
negative supported horizontally in a mount; the phototransducer 
is immediately beneath the negative under the microscope objective 
above. With a linear photocell, the output reading with clear glass 
in the photographic plate is proportional to intensity J and with 
the black silver deposit inserted this intensity is reduced to 7, 
giving a corresponding photocell output. The optical density D of 
the area of the negative explored is then given by the value of 

logic (/<>;/). 

The slit-shaped area of the photographic negative explored may 
be as small as 0001 cm in width. The negative is supported firmly 
in a mount which can be moved across the light beam by a microm- 
eter screw drive such that the exact position of the area of the 
negative can be read on a calibrated scale. Motion of a spectro- 
gram in one direction across the light beam is usual, but in some 
microphotometers a drive to impart motion in two perpendicular 
directions is provided. 

A precision instrument needs stabilised power supplies so that 
the current through the light source (which illuminates the slit) 
and any p.d. used in the photocell circuitry are both independent 
of fluctuations in the mains supply voltage. In more expensive 
microphotometers, the phototransducer output and the monitoring 
of the position of the negative holder (which is made to traverse 
slowly through the beam of light by a motor drive) are displayed 
on a pen recorder; a plot of optical transmission or density against 
wavelength is thus obtained directly and automatically. 

When the spectrograph used has adequate dispersion and chro- 
matic resolving power, the exploration of the spectrogram by a 
microphotometer is able to record fine structure invisible to the 
eye. A microphotometer capable of recording or detecting very 
small differences in light transmission must be contained in a light- 
tight housing, and care must be taken that only the light trans- 
mitted through the photographic plate is incident upon the photo- 
transducer. 



238 UNIVERSITY OPTICS 

5.34 Outline of Applications of Absorption Spectrophotometry 

5.34.1 THE CONCENTRATION OF A SOLUTION 

The concentration c of a solution may be found by determining 
at a suitable wavelength X at which absorption is significant the 
optical density D. If, for the same solute and solvent and the same 
wavelength A, the optical densities £>i and £> 2 of solutions of known 
concentrations C\ and c 2 are found, the unknown concentration is 
given from 

c-ci _ D-Di 
c—d D—D 2 

This technique may be extended to a solution containing a 
number of different solutes. For each of the different solutes a 
suitable wavelength is chosen at which the absorption of radiation 
is pronounced, and this leads to the determination of the various 
concentrations. 

5.34.2 THE INVESTIGATION OF CHEMICAL STRUCTURE 

One of the main fields of application of absorption spectrophotom- 
etry is the investigation of chemical structure. Absorption spectra 
resulting from electronic transitions are associated with photons of 
wavelengths in the visible and ultra-violet regions. At wavelengths 
above about 8,000 A in the infra-red, the energies of the photons 
in the incident radiation are insufficient to produce changes in 
electron energy levels; however, energy transitions associated with 
molecular vibrations and rotations are excited, in some cases far 
into the infra-red. 

If the specimen is gaseous, the atoms or molecules are relatively 
far apart except at high pressures. They may be considered isolated 
from one another in that the energy levels in one atom or molecule 
are scarcely affected by its neighbours. Fine details in the absorp- 
tion spectra are therefore observable. For specimens in the con- 
densed state (liquid, solid or in solution) the proximity effect of 
atoms or molecules on their neighbours becomes pronounced: 
energy levels are no longer discrete but overlap. The absorption 
spectrum detail is therefore no longer fine because lines become 
spread out into overlapping bands. 

Ultra-violet absorption spectroscopy is used considerably in 
studies of electronic transitions in organic chemistry. In the visible 
region, spectrophotometry is applied to dye-stuffs, light filters and 
studies of gaseous discharges. A famous application in astronomy is 



SPECTROSCOPY 



239 



the investigation of the Sun's chromosphere. Over 500 absorption 
lines — the Fraunhofer lines — have been identified in the contin- 
uous solar spectrum, and these have led to a detailed knowledge 
of the constituents in the Sun's chromosphere. 

Gases and liquids absorb radiation of certain wavelengths char- 
acteristic of the substance, especially in the infra-red region. 
This allows rapid chemical analysis in the identification of sub- 
stances and, as noted in Section 5.34.1, the quantitative estimation 

Table 5.7. 

SMALLEST CONCENTRATION OF GASES WHICH CAN BE MEASURED WITH NOT 
MORE THAN 2% ERROR 



Gas 


CO 


co 2 


CH 4 


C2H2 


C 2 H 4 


C 2 H e 


Vapours of 
solvents 


Concentration 
(vol. %) 


005 


0005 


01 


01 


005 


01 


01 



of the amounts of these substances. Infra-red gas analysers make 
use of this principle. Table 5.7 gives the smallest concentration of 
certain gases which can be measured with not more than 2% 
error. 

Infra-red radiation of a wavelength selected as being suitable 
for a particular gas is divided so that a portion passes through an 
absorption cell filled with dry air while the other portion is passed 
through a similar absorption cell into which the gas sample is 
passed. Radiation detectors at the exit of each cell may be so con- 
nected that their combined output is fed to a meter calibrated di- 
rectly in gas concentration. Variations in the carbon dioxide 
content of the atmosphere (normally about 0-03%) due, for exam- 
ple, to sunlight, to bacterial action in soils or to plant life, may be 
satisfactorily investigated. Combustion processes in petrol and 
diesel motors can be recorded. 

Infra-red absorption methods are also used to investigate organic 
compounds of biochemical and biological interest. As a qualitative 
example, plant pigments in minute quantities can be identified. 
As a quantitative example, the strength of a penicillin solution may 
be found by absorption. 

The interpretation of data on chemical constitution is not 
always easy, as several groups may have close absorption fre- 
quencies extending over a range which makes their resolution 
difficult. The type of solvent can sometimes affect the solute absorp- 
tion wavelengths and, for example, the absorption wavelengths 



in 

« 
2 
a 



Substitute in an aromatic 
compound 


cd 

s 


00 


o 
o 








•2" 3 

a O 




1 
o 




f-H 

s 


«3 


z 

II 

u 




■° 1 

Si 
S 1 




X 

l 

z 


o 

en 


M 
O 
o 


«n 
t- 


S 




E 

■« 
.1 SP 

ft w 

St ^ 

§ § 



SPECTROSCOPY 241 

due to a double bond may not be the same for an aldehyde as for a 
ketone. Halogens, sulphur, silicon and phosphorus in organic 
compounds can be identified, and a whole range of compounds 
(such as pyridine, naphthalene, anthracene and olefins) have char- 
acteristic absorption wavelengths. Some characteristic absorp- 
tion wavelengths are given in Table 5.8. 



5.34.3 FLASH PHOTOLYSIS 

The progress of chemical reactions with time may be followed by 
absorption spectrophotometry. A useful technique is that of flash 
photolysis: an intense flash of activating radiation impinges on the 
chemical; the photochemical products resulting are short-lived; 
they are identified, and their decay is studied by recording absorp- 
tion spectra. 



5.35 Astronomical Applications of Spectroscopy 

Spectral observations have been of immense value to astronomers. 
Much of the information on the nature of the Universe is a con- 
sequence of the precision with which astrophysicists have been 
able to measure the wavelengths of spectral lines. 

Information on stellar and solar velocities is obtained from 
measurements of the Doppler eflect. The velocity of light in free 
space c is constant. A radiating body which is receding from the 
Earth at a velocity v will radiate wavelengths appropriate to its 
composition, but these wavelengths will be longer than those 
radiated from the same elements on Earth. The apparent wave- 
length X is given by 



-*K) 



where A is the wavelength measured when v = 0. Observations of 
this 'red shift' show that the more distant nebulae are moving 
away from the Earth faster than near nebulae. This leads to the 
conclusion that the Universe is expanding more rapidly at its 
extremities. Red shifts of 200 A have been observed. 

By observing periodic variations in the wavelength of light from 
double stars, their velocity and period of rotation can be measured. 
Similarly, the rings of Saturn are shown to be non-rigid as spec- 
troscopy evinces that the inner portions move faster than the outer 
ones. 

The temperature of stars may also be estimated from the broaden- 
ing of spectral lines due to Doppler eflect. Atoms radiating at high 
temperatures exhibit broader spectral lines than those at low 



242 UNIVERSITY OPTICS 

temperature, because they have greater translational velocities. An 
atom moving at a speed of 3 km s -1 will appear to have a wave- 
length differing by one part in 100,000 from the radiation from a 
stationary atom. This is equivalent to 0-059 A for the sodium 
5,896 A spectral line. Thus r.m.s. speeds, which are proportional 
to temperature, may be calculated for elements in the star from 
the width of their spectral lines and the temperature may be com- 
pared with that estimated from the radiation laws. 

Measurements of the wavelengths in the emission spectra of stars 
permit their constituent elements to be determined. From the 
absorption spectra, the composition of the atmospheres of stellar 
bodies is found. 

The splitting of spectral lines due to the Zeeman effect is utilised 
to investigate distant magnetic fields in space. The Stark effect 
can be used to detect strong electric fields near radiating atoms: 
a potential gradient of the order of 10 5 V cm -1 is necessary before a 
measurable change in wavelength occurs. 

A slight shift in the position of spectral lines gives information 
on high stellar densities. Otherwise, density can only be estimated 
from observations on the motion of double stars. 



5.36 Advances in Spectroscopy in the Far Infra-red 

It is not possible to be specific about the limits of the far infra-red 
region, but generally it is considered to lie between wavelengths 
of 40 jxra. and 2,000 /an. Spectroscopists prefer to specify wave 
numbers (y = 1/A) rather than wavelengths, so in wave numbers 
the region is between 250 cm -1 and 5 cm -1 . Though studies in 
this spectral region began more than 50 years ago, progress was 
slow, primarily because of the lack of an intense broad-band source 
for absorption spectroscopy. In this region, the radiation suffers 
massive absorption in water vapour. Spectrometers used must 
therefore be evacuated. 

Suitable detectors for far infra-red radiation were lacking until 
the Golay cell (Section 5.15) appeared in the 1950s; from then 
onwards, progress has been rapid. Impetus to research comes from 
the fact that simple solids and molecular compounds exhibit im- 
portant spectra in this region. Current research arising from such 
spectral studies includes investigations of: the lattice vibrational 
properties of crystals as functions of temperature and pressure; 
the optical and dielectric properties of solids; the rotational spectra 
of molecules; the intermolecular motions of large molecules; ex- 
change interactions between magnetic ions ; ferroelectric transitions ; 
semiconductors and superconductors; and the vibrational spectra 
of liquids. 



SPECTROSCOPY 243 

The region has been approached in two ways: from the shorter 
wavelength end by the extension of infra-red techniques and from 
the longer wavelength end by the extension of microwave radio 
methods. The former has the advantage over conventional spectros- 
copy that the precision demanded of the optical components is 
not so great as in the visible region, but the disadvantage that 
suitable transparent materials are few in number and have poor 
dispersion properties. The use of radio methods, necessitating very 
small resonant cavities for submillimetre wavelengths, is more 
demanding on precision than work at normal millimetre and centi- 
metre wavelengths. The region in which the optical and micro- 
wave 'radio' methods merge employs techniques known as 'quasi- 
optical'. Submillimetre methods utilising tuned cavities, wave- 
guides and electron beam sources are not to be described in this 
book. 

Apart from the submillimetre microwave methods, the two 
techniques practised in far infra-red spectroscopy are the use of 
diffraction gratings in the form of echelles and echelettes (Section 
5.24), and Fourier transform spectroscopy utilising interferome- 
ters. 



5.37 Grating Spectroscopy in the Far Infra-red 

The broad-band source of radiation used is a hot body with black- 
body type spectral distribution. The most effective is the quartz- 
enveloped mercury arc lamp. At wave numbers above 100 cm -1 
the radiation is emitted by the hot quartz envelope; at wave 
numbers below 100 cm -1 , the radiation is predominantly from 
the mercury discharge. 

A typical monochromator is that due to Czerny and Turner 
(Fig. 5.27), and Littrow models are also used. The echelle diffraction 
grating employed is usually 10 cmX 10 cm, blazed at an angle of 
20° and with 15-120 rulings per centimetre. The spherical mirrors 
for the 'collimator' and 'telescope' are coated with gold, silver or 
aluminium and have a reflectivity of 98-5% in the far infra-red 
region. Window materials (and in further developments, lenses, 
light pipes and light cones) transparent to the radiation are fre- 
quently of polyethylene, but quartz, diamond, alkali halides and 
other ionic crystals (Section 5.18) are used over select regions with 
silicon and germanium at low wave numbers. 

The dominant problem is to prevent the detector from respon- 
ding to unwanted radiation and ensure that it receives only radiation 
from the source. Such unwanted radiation arises from the compo- 
nents and walls of the spectrometer; furthermore, high-energy 
radiation from higher orders appearing in the first order (the one 



244 UNIVERSITY OPTICS 

normally used) of the diffraction pattern is a particular source of 
difficulty. 

To avoid this unwanted radiation, two procedures are adopted. 
The first is to chop mechanically the radiation from the source 
(usually done at 13 Hz) and tune the detector amplifier to respond 
only to signals at this frequency. This avoids output signals due 
to radiation other than from the source itself. The second is to 
employ filters to reduce the unwanted radiation from higher orders, 



TO DETECTOR 




SPHERICAL MIRRORS 



INCIDENT RADIATION fROH QUARTZ ENVELOPED 
MERCURY LAMP VIA ABSORBER UNDER STUDY 

Fig. 5.27. Outline of a Czerny-Turner moiwchromator for the far infra-red 

which may reach the first order. Such filters are: absorption or 
transmission types which make use respectively of the strong 
absorption bands in ionic crystals or the scattering and diffraction 
properties of polyethylene transmission gratings for wavelengths 
less than the grating space; gratings and metal meshes for which 
radiation of wavelength greater than the mesh spacing is specularly 
reflected; Reststrahlen types which utilise the strong reflection 
band of ionic crystals; and Fabry-Perot types made of parallel 
metal meshes which can be made to isolate a narrow spectral 
region. 

The Golay cell is widely used. Other detectors employed include 
photoconductive indium-antimony or germanium at 4-2°K, point- 
contact crystal rectifiers, bolometers, superconducting Josephson 
tunnel diodes, and pyroelectric thermal devices. 



5.38 Fourier Spectroscopy 

This sophisticated and expensive technique for obtaining the power 
spectrum of radiation was first used in 1911 by Rubens and has 
been reborn following the work of Fellgett in 1951. It is tending 
to overhaul grating spectroscopy because of the following advan- 
tages: 



SPECTROSCOPY 245 

1. The resolution of the grating spectrometer in the far infra-red 
is limited by slit width; this does not apply to Fourier spec- 
troscopy, so much higher resolution is achievable. 

2. At any one time, the whole spectral range is incident on the 
detector in Fourier spectroscopy whereas in grating methods 
only a narrow width of the spectrum is incident. Not only 
does this mean a greatly improved signal-to-noise ratio, but 
it also enables recording to be much more rapid. 

3. The filtering of unwanted radiation is not so stringent as in 
grating spectroscopy. 

Disadvantages are that the spectrum is not measured directly, 
that the procedure is not very suitable for very narrow range 
investigations, and that the technique is expensive. 

A two-beam interferometer is used, usually with a beam-splitter 
made from a melinex membrane. The flux/* of infra-red radiation 
reaching the central fringe is a function of the path difference x 
between the two beams. (This compares with the use of the Michelson 
interferometer to determine the fine structure and breadth of 
spectral lines by the 'visibility' technique.) This function contains a 
constant term and an oscillatory term F x . The flux I x is called the 
interferogram. This interferogram is characteristic of the incident 
spectrum of radiation and can be analysed to yield the spectral 
distribution. 

This procedure is known as the 'multiplex' method, implying the 
simultaneous detection of a broad spectral region of undispersed 
radiation with a single detector. The intensity of each spectral 
component of width Sv cm -1 (in wave numbers) in a broad spec- 
trum is coded by the specific orthogonal code of sine and cosine 
simple harmonic functions. The intensity of each spectral compo- 
nent is therefore modulated sinusoidally at a frequency proportional 
to its wave number v cm -1 . The width of the component depends 
on the duration of the modulation. 

Amplitude division, wavefront division and polarising inter- 
ferometers have all been used for multiplexing. From such instru- 
ments, the observed interferogram is a spatial frequency represen- 
tation of the spectral function. Consequently, analogue or digital 
computer Fourier transformation must be undertaken to obtain 
the power function. • 

If a two-beam interferometer is illuminated with monochromatic 
radiation of intensity 7 and frequency v, 

I x = /o(l+cos 2nvx) 
For a spectral input represented by T(v) 

I x = j /(j>)(1+cos 2nvx) dv = -|/(0)+J" I{v) cos 2nvx dv 



246 UNIVERSITY OPTICS 

where 7(0) is the intensity at zero path difference. Application of the 
Fourier integral theorem for the even function I x gives the spectrum 

I{v) = 4 J r/ x -|/(0)] cos 2nvx dx 
o 

in terms of the measured I x . This is the fundamental relation of 
Fourier spectroscopy. The interferogram is measured as a function 
of path difference, and the spectrum is computed by analogue or 
digital means. 

In the periodic method, the path difference is varied rapidly with 
a saw-tooth motion of period 2T . Each resolution width of the 
spectrum appears as an audiofrequency harmonic of fundamental 
frequency l/T . The spectrum is recovered by passing the signal 
from the far infra-red detector through an amplifier of bandwidth 
less than l/T , and which is tuned to each harmonic in turn. Such 
spectrometers lack the advantage of obtaining the whole spectral 
range of interest at one time (called the multiplex advantage), 
because they sample only one harmonic in the interferogram at a 
time. 

The alternative aperiodic method is preferred. The interferogram 
is recorded once, and the spectrum computed. The interferogram 
is sampled at small uniform intervals of path difference. The Fourier 
integral is approximated by a sum in the computer. 



Exercise 5 

1. Write an account of the chief characteristics of photographic emulsions in 
relation to their use in recording radiation. 

2. Write an essay on the various techniques used in spectroscopy to record 
infra-red radiation. 

3. Describe, with a diagram, a thermal Golay cell for measurements of infra- 
red radiation and compare its performance with that of the bolometer and 
of the thermopile. 

4. Describe briefly the factors in spectroscopic sources which give rise to line 
broadening and show how each factor may be minimised. 

Describe a source and a dispersive instrument capable of resolving the 
hyperfine structure of the sodium D lines. (Poly.) 

5. Explain how a plane diffraction grating is used in spectroscopy and derive 
the relation which gives the angular positions of the principal maxima when 
monochromatic light is incident obliquely. 

A wide reflection grating having 3000 lines cm -1 is used in a monochroma- 
tor, the angle between the incident beam and the diffracted beam being 
fixed at 10°. The grating which is adjusted to be normal to the incident 
beam is illuminated with white light and a lens of focal length 50 cm is used 



SPECTROSCOPY 247 

to focus the diffracted beam on the exit slit which is 0-1 cm wide. Determine 
the mean wavelength and range of wavelengths of the visible light emerging 
from the slit. (L.G.) 

6. Describe one form of prism spectrograph suitable for use in the UV down 
to 2300 A. Explain the choice and use of the prism and any lens included. 
Explain the use of such an instrument with photographic plates either to 
identify elements present in a metal alloy or to determine the proportion of 
a particular element present in each of a series of closely similar alloys. 

(L.Anc.) 

7. Write a short essay on the precise measurement of the wavelength of a stan- 
dard line in the spectrum. (L.G.) 

8. Explain the formation of a spectrum by means of a plane diffraction grating 
used either for transmission or for reflection. Describe the arrangement of 
a grating spectrometer for the examination of absorption spectra in the 
infra-red region explaining the function of each part. Mention (a) a suitable 
source and (b) a suitable detector for the radiation. (L.Anc.) 

9. Write an essay on spectroscopic techniques used in either (a) the ultra-violet 
or (b) the infra-red regions of the spectrum. (L.P.) 

10. Give a general account of the methods used for obtaining emission spectra, 
including the investigation of fine structure. What is the significance of the 
breadth of the lines? (L.Anc.) 

1 1 . Parallel light is incident on a parallel-sided block of thickness / and refractive 
index ft. If the angle of refraction in the medium is r show that the phase 
difference between the directly transmitted ray and the ray reflected once at 
each surface is 4nfit cos rjX where A is the wavelength in vacuo. 
Describe briefly a method of calibrating a spectroscope by the use of a thin 
air film between parallel plates. How may fringes of good visibility be ob- 
tained ? For a film of thickness 0-01 mm,a bright fringe coincides with light of 
wavelength 6250 A. At what shorter wavelengths will the 1st and 10th fringes 
from 6250 A occur? (L.G.) 

12. Give a brief account of the ways by which the spectroscopy of electromag- 
netic radiation has been extended to regions beyond the visible. Describe 
in detail the techniques involved in one region. (L.P.) 

13. Give an account of some form of infra-red spectrometer including a descrip- 
tion of (a) the method of recording the spectrum, (b) the determination of 
wavelength. Discuss briefly one important application of investigations in 
the infra-red. (L.G.) 

14. Write an essay on the methods used in infra-red spectroscopy, and the infor- 
mation about molecules that can be obtained from infra-red absorption 
spectra. (L.Anc.) 

1 5. Explain how the Doppler effect influences the radiation received from terres- 
trial and astronomical sources. 

What steps can be taken to minimise its effect in terrestrial sources and to 
what useful information does its presence lead in astro-physics? (L.P.) 



248 UNIVERSITY OPTICS 

16. How may an interferometer be used for accurate determination of the wave- 
lengths of spectral lines from a source which has many lines in the visible 
region? Give a careful sketch of the apparatus including the approximate 
physical dimensions of all components of the system assuming that 10 -2 A 
resolution is required. 

Calculate the approximate size of diffraction grating which would give the 
same resolution. (M.P.) 



CHAPTER 6 



More Advanced Treatment 
of Diffraction and Interference 



6.1 The Diffraction of Electromagnetic Waves 

If it is assumed that radiation is propagated in straight lines in a 
uniform isotropic medium, an obstacle placed between a source of 
radiation and a receiving screen will result in a shadow at this 
screen of shape and dimensions decided by simple geometrical 
considerations. Taking into account that the radiation has a finite 
wavelength, however, it is established that the radiation spreads 
into this geometrical shadow because of the phenomenon of 
diffraction. This is explained in the simplest instances by Huygens' 
principle (Section 7.6 of Volume 1). It is assumed that all points 
on the wavefront in an aperture (or around an obstacle) act as 
secondary sources from which the energy is radiated in the form of 
secondary wavelets without phase change to points on the side of 
the aperture (or obstacle) remote from the primary source. This 
gives rise to an intensity distribution which forms the spatial 
diffraction pattern for the particular aperture or obstacle and 
wavefront. 

In a more general sense, an electromagnetic wave incident on a 
shield containing an aperture will give rise to effects both within 
the aperture and on the shield. The secondary sources within the 
aperture will radiate in a complex manner with differing amplitudes, 
states of polarisation and phases not only on the side of the aperture 
remote from the primary source but also on the primary source side 
as well. A complete treatment of diffraction to deduce the spatial 
intensity distribution of light requires that all these effects be taken 
into account. Fortunately, in practice, it is only necessary to deter- 
mine effects very close to a shield or aperture. 

249 



250 



UNIVERSITY OPTICS 



If far-field paraxial effects with a symmetrical aperture are being 
considered, as often happens in practice, polarisation and shield 
effects can be neglected and adequate results obtained by the method 
of Kirchhoff (Section 6.15). This method takes into account the 
phase change on re-radiation (i.e. from the secondary sources) 
and the obliquity of a secondary source with respect to the point 
at which intensity is to be determined. If the former of these fac- 
tors—the phase change — is also neglected and allowance is made 
in only a general and approximate manner for the latter factor— 
the obliquity— the comparatively simple Fresnel treatment is valid. 
This Fresnel procedure is nevertheless adequate for most optical 
diffraction in the visible region. The Fresnel method is therefore 
described first, and the more advanced procedures which take into 
account shield effects, phase and polarisation changes are left till 
later. 



6.2 Fresnel's Treatment of a Plane Wavefront in a Circular 
Aperture 

A uniform collimated beam of monochromatic radiation of wave- 
length X is incident normally on a circular aperture in a shield. 
The wavefront within this aperture is therefore plane. Let O be 
the centre of the aperture, and P a point on the axis at a distance 
b beyond the aperture centre (Fig. 6.1). It is required to find an 




Fig. 6.1. Fresnel's treatment of a plane wavefront in a circular aperture 

approximate value for the amplitude of the wave at P. This ampli- 
tude will arise from the interference of wavetrains from uniformly 
distributed secondary sources over the wavefront W in the plane 
yOz, where Ox is the direction of the incident light. 

The wavefront may be conveniently divided into very narrow 
concentric annuli about the point O. For a particular annulus 



TREATMENT OF DIFFRACTION AND INTERFERENCE 251 

-n shaded in Fig. 6.1), its contribution to the amplitude at 
depend on the area of the annulus and the amplitude at the 
ront W, and on the distance p from any point Q on this 
us to P, where intensity (proportional to the square of the 
tude) falls off in accordance with the inverse square law. 
i phase of the wavetrain from this annulus may be conveniently 
ired relative to the phase of a wavetrain from a very small 
it O. It will thus be (2n/X)(p-b). 

: obliquity of any point Q in the wavefront W with respect 
point P is measured by the angle between the line QP and 
>rmal NQ to the wavefront at Q. If this angle 8 is not too 
For a particular annulus, its effect may be neglected; further- 
the inverse square law effect will be constant, as will be the 
tude over the wavefront. 
.. is, in estimating the relative effect at P due to the annulus, a 
vector may be drawn having a length directly proportional to the 
area of the annulus and a phase angle of (2n/X) (p—b). For a contin- 
uous range of such annuli, the vectors representing the amplitude 
at point P due to the very thin annuli become part of the arc of a 
circle. The resultant effect is then represented by the chord which 
joins the ends of the arc. 

As a matter of convenience, Fresnel divided the wavefront into 
'half-period' zones. The distance/? to the extremity of each succeed- 
ing half-period zone from the centre outwards increases by half 
a wavelength [Fig. 6.2(a)]. The effects of the very thin annuli at 
the extremities of a given zone thus differ in phase angle by jr. 
The diagram representing the addition of the successive vectors for 
the very thin annuli comprising such a half-period zone is shown 
in Fig. 6.2(b). 

The area of an annular zone with an inner radius r and outer 
radius r+dr and where p increases to p+8p (Fig. 6.3) is 2nr dr. 
Now 

Differentiating with respect to r, where b is constant, 

Therefore 

r br = p dp 

Hence the zone area = 2np dp. 
For the nth Fresnel half-period zone, 

P = b+— 



THIRD ZONE 
PERIMETER " 

SECOND ZONE 
PERIMETER 



FIRST ZONE 



SECOND ZONE 
PERIMETER 

THIRD ZONE 
PERIMETER 




RESULTANT VECTOR FOR 
HALF-PERIOD ZONE 



(b) 



(O) 




SEPARATE VECTORS 
FOR ANNULI COMPRISING 
A HALF- PERIOD ZONE 



VECTOR FROM A IN ANTI -PHASE 
WITH VECTOR TERMINATING AT B 



Fig. 6.2. Fresnel's half-period zones 




Fig. 6.3. The areas of Fresnel zones 



TREATMENT OF DIFFRACTION AND INTERFERENCE 253 

and 

8p =2 
The area 6S of the nth zone is therefore given by 

If the term in A 2 is neglected, 

dS = nbl (6.1) 

which is a constant to the first order of small quantities. 

As the effects of neighbouring half-period zones differ in phase 
by jt and their areas are equal, it follows that the resultant effect 
of adjacent zones will be nearly equal and opposite. In fact, as n 
increases, the zone areas increase slightly, but they become pro- 
gressively farther away from P; these two factors can be shown to 
balance out. Therefore, apart from the obliquity factor, each zone 
does give at P a contribution of the same amplitude. The effect of 
the obliquity factor is to cause the amplitudes due to successive 
zones to decrease gradually. If, therefore, the amplitudes of the 
vibration at P due to zones 1, 2, 3, . . ., n are A u A 2 , A 3 , ..., A n , 
the total amplitude A due to n zones at P will be given by 

A = A 1 -A 2 +A 3 -A i + . . . ±A n (6.2) 

The alternation of signs is due to the fact that the amplitude 
contributions from successive zones are in anti-phase. 
If n is odd, this series is conveniently rearranged to become 

^+(T-^ + T) + (t-^)+ •• +^ <6.3) 

^-T-^+TH^'+f)-- 

A„-i 
- -^ r +A n (6.4) 

As the terms A lt A 2 , A 3 , ... decrease progressively, each term is 
greater than the mean value of the terms preceding and following 
it. The quantities in brackets in Equations 6.3 and 6.4 are therefore 
all negative. It follows that 

"2" + T^ * — 2 2~~ " 



or 



254 UNIVERSITY OPTICS 

As Ax and A 2 ate very nearly equal, as also are A„_x and A„, 

A = ^- + ^- (6.5) 

The same argument and result applies if the value of n is taken 
to be even. 

The conclusion is that the resultant amplitude A at P due to n 
zones is equal to half the sum of the amplitudes contributed by 
the first and the nth zone, where the nth zone will be the last one 
in a given aperture. 

Two cases of particular interest arise: 

1. The radius of the aperture is of significant size compared 
with the distance OP, so the number of half-period zones n is 
vary large. As the angle (Fig. 6.1), which decides the obliquity 
factor for this final zone, will be large, the contribution of the 
nth zone is very small compared with that of the first one 
(i.e. A n <sc Ai). Equation 6.5 shows that the resultant ampli- 
tude at P will then be only slightly larger than half the amplitude 
due to the first zone alone. 

2. The radius of the aperture is small compared with the distance 
OP so that the number of half-period zones n is not very large. 
Equation 6.5 will apply. If, however, the number of zones n 
is even, A„ will lie in the opposite direction to A u (i.e. A„ and 
A\ are in anti-phase), so 

A = Ax — An 

and as A„ is not much smaller than Ax, A will be small. On the 
other hand, if n is odd, A„ will be in the same direction as 
Ax {A n and Ax are in phase) so 

A = Ax + A n 

and A will be large. 

For an aperture of given radius, the variation of amplitude 
with distance b = OP as the point P in question on the axis moves 
away from O is represented by Fig. 6.4. Maxima will occur on 
this graph wherever P is at such a distance that n is odd, and minima 
will occur whenever n is even. The same variation in amplitude 
will be obtained at a fixed point P on the axis as the radius of the 
aperture is gradually altered. 

For an off-axial point P' (Fig. 6.5), the amplitude may be deter- 
mined by considering the zones and parts of zones effective within 
the aperture. 



TREATMENT OF DIFFRACTION AND INTERFERENCE 

I ZONE ONLY 



255 




Fig. 6.4. Plot of the amplitude\pf the resultant vibration at P as the distance b 
is increased (A l is the amplitude due to the first zone only) 



APERTURE 
EDGE 



J 



, ZONE I 



ZONE 2 



ZONE 3 




Fig. 6.5. How the amplitude of an off-axial point P is related to the zones and 
parts of zones within the aperture 



6.3 A Spherical Wavefront in a Circular Aperture 

The radiation incident upon the circular aperture is from a point 
source in a homogeneous medium, so the aperture contains a 
spherical wavefront. This spherical surface may be divided into 
Fresnel zones in the same way as for the plane wavefront (Section 
6.2) and the amplitude at an axial point P beyond the aperture 
determined. However, the obliquity factor is more pronounced, as 
also is the variation in the distance/* from a point Q in the wavefront 
as Q moves from the pole of the wavefront at O to the extremity 
E of the aperture (Fig. 6.6). The results obtained from Fresnel's 
theory are thus only strictly applicable to small apertures. 

To find the area of a half-period zone in a spherical wavefront, 
let AB be such a zone (Fig. 6.7), where S is the point source of 
radiation at distance a from the wavefront and the point P is at 
distance b beyond the wavefront. Let x be the distance along the 
axis of symmetry SP to the extremity of the zone through A, and 



256 



UNIVERSITY OPTICS 



PERIMETER OF THE n" 1 ZONE 




Fig. 6.6. A spherical wavefront in a circular aperture 

x+bx this distance to the extremity of the zone through B. Let 
^.BSO = cj>, where O is the pole of the wavefront and ^.BSA = 8<j> 
The area bS of the zone AB is given by 



Also, 

Therefore 

Thus 



dS = ad(j> 2na sin <f> = 2nd? sin <$> b4> 
BF 2 =p* = cP+(a+bf-2a(a+b) cos <j> 
p dp = a(a+b) sin <f> d<t> 
2nap bp 



bS = 



a+b 



If AB is the nth Fresnel zone from the centre, p = b+(nt./2) 
and bp = A/2. Therefore 



.„ 2na /, , wA\ 

ds = ^b (* + t) 



«A\ A 
2 




Fj£. 6.7. y4rea of a half-period zone in a spherical wavefront 



TREATMENT OF DIFFRACTION AND INTERFERENCE 257 

If terms in A 2 are neglected, then 

55 = ^ (6.6) 

a+b 

This reduces to the expression for the area of a zone in a plane 
wavefront (Equation 6.1) when a -► °=> because, as a — °°, 

ribX , . 



1 Hb/a) 

This would be expected, as a spherical wavefront becomes a plane 
one when its radius is infinitely long. 
The radius r„ of the nth Fresnel zone is given by 

where p = b+(nX/2). Therefore 

If n is not too large, the term involving A 2 may be neglected. 
As b and 1 are constants 

r n cc V" ( 6 - 7 ) 

For a particular position of the axial point P beyond the wavefront, 
it is seen that the radii of the successive Fresnel zones are propor- 
tional to \/n. 



6.4 The Zone Plate 

As the effects of adjacent Fresnel zones at an axial point P are in 
anti-phase, the resultant amplitude at P is due to the effect of half 
the first and half the last zones only. If, however, alternate zones 
are rendered ineffective, the amplitude at P will result from half 
the total number of zones and will consequently be greatly in- 
creased. 

The zone plate is a transparent plane sheet of glass at which 
alternate Fresnel zones are made opaque. A simple method of 
making such a zone plate is first to draw in black ink on a white 
card a set of concentric rings having radii proportional to the 
square roots of the natural numbers 1, 2, 3, . . . (Equation 6.7). Every 
other zone between rings is made black. This card is photographed. 
The glass plate negative contains about 30 transparent zones which 



258 



UNIVERSITY OPTICS 




Fig. 6.8. A zone plate (about six times usual size) 

alternate with black opaque ones in a diameter of about 1 cm 
(Fig. 6.8). 

This zone plate has a focusing action rather like that of a con- 
verging lens. Thus suppose a zone plate Z is set up normal to an 
axis between a point source S and a point P, where SZ = a and 
ZP — b (Fig. 6.9). Suppose the radius of the first transparent 
circular region of the zone plate is r. Let point P be at such a posi- 
tion that there is an odd number of zones (2k + 1) effective at P 



BRIGHT IMAGE OF S 




ZONE PLATE 



SUBSIDIARY IMAGES/ 
OF S ' 



Fig. 6.9. Action of a zone plate 

from this first transparent region, where k is an integer. This odd 
number will give a maximum amplitude at P (Section 6.2). As the 
transparent region has an area of jrr 2 and each Fresnel zone within 
it has an area nabkl{a+b) (Equation 6.5), it follows that 



itn 



nabXj(a + b) 



2k + l 



TREATMENT OF DIFFRACTION AND INTERFERENCE 259 

Therefore 

This equation is similar to that relating the real object and image 
distances x and x' for a converging thin lens of focal length /, 

xx f 

By analogy, a is the object distance and b the image distance. As all 
the alternate transparent regions in the zone plate act like the first 
one and all give amplitudes at P in phase, there will be a bright 
image at P. The focal length of the zone plate is seen by comparison 
of Equation 6.8 and Equation 6.9 to be given by 

,.2 
/ = 



(2/fc+l)A 



There will therefore be a set of focal lengths corresponding to 
various values of the integer k and for a given wavelength X. 
Correspondingly, for a given object distance a, there will be a set 
of image distances b. 

If the zone plate were not present, the amplitude at point P due 
to the unimpeded wavefront would be Ai/2 — half that due to the 
first Fresnel zone alone. With the Fresnel zone plate present, sup- 
pose P is at such a position that a small odd number of Fresnel zones 
exist in the first transparent region, i.e. 2fe + 1 is small. From Section 
6.2, it is seen that the amplitude at P due to this first region only is 
Ai+A„ = 2Ai. But there are several such regions in the zone plate, 
all producing the same additive effects at P because the anti-phase 
regions are opaque. If this number of transparent zone plate re- 
gions is 10, the amplitude at P will be 2(Mi. The intensity at P is 
therefore 400 times that which would be obtained if the zone plate 
were absent. 

The intensity at P will be greatest when the transparent regions of 
the zone plate each transmit three Fresnel zones. This will corre- 
spond to a primary focal length of the zone plate. There will be short- 
er focal lengths corresponding to five, seven, nine, . . . Fresnel zones 
within each transparent region at which subsidiary maxima in inten- 
sity will occur. Between maxima, much smaller disturbances will 
exist corresponding to the situation where each transparent region 
of the plate contains an even number of Fresnel zones (Section 6.2). 

An even more efficient zone plate has been made in which the non- 
active regions of the plate are replaced by transparent crystalline 



260 



UNIVERSITY OPTICS 



deposits which introduce a phase change of jr. All the regions of the 
plate are now operative in phase. Ideally, such a plate will give four 
times the intensity due to that of the conventional zone plate. 



6.5 The Cylindrical Wavefront; The Fresnel Integrals 

The treatment of a cylindrical wavefront by Fresnel zone methods is 
more complicated than for plane and spherical wavefronts. This is 
because the Fresnel half-period zones are parallel strips not having 
circular symmetry about an axis as do the annular zones employed 
in the analysis for plane and spherical wavefronts. 

A slit source S of monochromatic radiation of wavelength A 
produces a cylindrical wavefront at a distance a in a rectangular 
aperture which can be divided into half-period zone strips as shown 




Fig. 6.10. Cylindrical wavefront in a rectangular aperture 

in Fig. 6.10; the outer extremities of these zones are successively at 
distances b + (A/2), b+2., Z?+(3A/2), ... from the axial point P, 
which is at a distance b on the axis SP beyond the pole of the wave- 
front at O. 



261 



TREATMENT OF DIFFRACTION AND INTERFERENCE 

Consider a strip AB on the cylindrical wavefront at a distance q 
along the arc from O, where the width of the strip is bq (Fig. 6.1 1). 
Let ^.OSA = </>, *£.ASB = d(j> and A be at a distance p from the 
axial point P beyond the wavefront about 0. 




Fig. 6.11. A cylindrical wavefront from a slit source 

The magnitude 6U of the vibration at point P at time t due to the 
strip AB is given by 

6U = k3q sin 2mU= - |A (6.10) 

where ris the period of the propagated wave motion and fc is a con- 
stant of proportionality, and provided that the angle <f> is small so 
that the vibration at the wavefront is proportional to sin (2nt(T). 
Putting p— b = A, Equation 6.10 then becomes 

dU = kdqsm2nU;-^-^) 

= k [ sia2jl (r-j) " 2 * A 



cos 



n It A\ . 2jiA1 
-cos2^- T jsin— j 



If the magnitude of the vibration at P due to the wavefront 
between limits q± and qi (provided that <j) is small) is U, 



U'=k 



sin 2ti: 



it b\ \ 

(t-t) — 



2nA 



dq 



— cos2tt 



(M)J- 



2nA , 
sin -^r- aq 



262 UNIVERSITY OPTICS 

For convenience, put 



and 



/i = cos —^— 6q = A cos a (6.11) 



A 



Then 



92 

f . 2ttA ... 
h = sin — r— aq = A sin a (612) 

C/ = iUsin [^(^-j)-*] ( 6I3 > 

where A* = ^+7|, tana = 7 2 //i an <l -^ is the amplitude of the 
vibration at P due to the cylindrical wavefront between q± and q 2 . 
The intensity at P is proportional to A 2 and is hence proportional 
to If+I%. The terms I t and 7 2 as expressed by Equations 6.11 and 
6.12 are known as the Fresnel integrals. 
From Fig. 6.11, 

p 2 = a 2 +(a+bf -2a(a+b) cos 4> 

or, as (j> is assumed to be small, 

p 2 = a 2 +(a+bf-2a(a+b) /l- Jj 

= b 2 + *°±M 
a 

Therefore 

p *_v = x«±M (6 . 14) 

Putting p+b — 2b, on the basis that q is small, and p—b = A, 
Equation 6.14 becomes 

26 A = 2(a+b \ 2 
a 

Therefore 

It is convenient to put 

2n A jto 2 



TREATMENT OF DIFFRACTION AND INTERFERENCE 263 

Therefore 

. a+b h? (( L\f\ 

A = — r~ q 2 = —r- (6.16) 

ab 4 

For given values of A, a and b, it follows on differentiating with 
respect to q that 

„ dv 

2a oc 2v —— 

dg 

or 

q da <x v dv 

The integrals h and h (Equations 6.1 1 and 6.12) may therefore be 
written in the forms 

h= fcos^-dv (6.17) 






71V 



,2 



sin —^- dv (6.18) 

and 

A* oc 11+ 1* (6.19) 

where now the integrals in Equations 6.17 and 6.18 are standard 
forms and may be evaluated from mathematical tables for any 
range — v, corresponding to a range — q along the wavefront 
as given by Equation 6.16. 

The results obtained may be regarded as approximately true for 
larger rectangular apertures. 



6.6 The Cornu Spiral 

The effect of the Fresnel zones, subject to the limitations already 
mentioned, may be obtained from vector plots based on the Fresnel 
integrals. A curve is plotted on Cartesian axes denned by the para- 
metric equations 

x = h and y = h (6.20) 

On this curve, an element of length ds at any point is related to dx 
and by by the equation 

(dsf = (dxf+(dyf 



264 UNIVERSITY OPTICS 

But x = h is given by Equation 6.17, and y = his given by Equa- 
tion 6.18. Therefore, 

(da? = /cos 2 ^-+sin 2 ^\ (flvf = {bvf 

Therefore 

s = v 

The curve plotted is therefore from the equations 

s 

x = h = cos -=— dy (6.21) 

o 
and 



y — h = | sin — y- dy (622) 



o 

The intrinsic equation of this curve is readily obtainable because 

dy ^ ra 2 

-^- = tan — =- = tan w 

ox 2 

where y> is the angle which the tangent to any element dy on the 
curve makes with the x-axis. Hence 

y = ^j-' (6-23) 

is the intrinsic equation required. It represents a spiral, known as 
CornWs spiral (Fig. 6.12), with asymptotic points Li and Lz having 
(x,y) coordinates given by 

, f ns* A , 7 . ra 2 A . 1 
x = y = ± cos -=- dy = ± sin -=- dy = iy 

o o 

On Cornu's spiral, any element of length bs is a vector of length 
proportional to the contribution bA to the amplitude at the point 
P (Fig. 6.11) brought about by an elemental zone on the cylindrical 
wavefront. The whole spiral is the 'polygon' of infinitesimally short 
successive vectors due to all the elemental zones into which the 
wavefront may be divided. If, therefore, the wavefront is limited 
(for example, by a rectangular aperture) so that the zone vectors 
concerned are between points Qi and Q% on the Cornu spiral 



TREATMENT OF DIFFRACTION AND INTERFERENCE 



265 




Fig. 6,12. Coma's spiral 

(Fig. 6.12), the addition of all these vectors gives a resultant decided 
by the straight line QxQz- The resultant amplitude of the radiation 
at point P beyond the limited wavefront is therefore proportional 
to 6182 and the intensity at P is proportional to {QiQzf. 



6.7 The Diffraction of a Cylindrical Wave by a Straight Edge 

The cylindrical wavefront at Wfrom the line source S is limited by 
a straight edge OC so that the portion of the wavefront to one side 
of O is shielded. It is required to determine the nature of the diffrac- 
tion pattern in a plane through P perpendicular to the axis SOP 
and beyond the wavefront W (Fig. 6.13). 

At the point P, only half the wavefront Wis effective. The resul- 
tant amplitude of the vibration at P due to the radiation from 5" is 
therefore represented by the straight line OLx on the Cornu spiral 
(Fig. 6.12). For a point such as Pi, above P, the pole of the wave- 
front Wis at Ox. Now one half of the wavefront is effective plus the 
part OO x of the other half. The appropriate vector representing the 
resultant amplitude at P x is now the straight line O x Lx on the Cornu 
spiral (Fig. 6.12). 

If the point P x is further removed from P, i.e. the distance / is 
increased (Fig. 6.13), so the pole Ox moves further along the unob- 
structed half of the wavefront W, and correspondingly point Ox on 
the Cornu spiral (Fig. 6.12) moves further around the lower spiral 



266 



UNIVERSITY OPTICS 



towards the asymptotic point L 2 . On the other hand, for Pi below 
P in Fig. 6.13 (i.e. if / becomes negative), the wavefront pole Ox 
moves to below O, and correspondingly the point 0\ on the Cornu 
spiral moves into the top half towards the asymptotic point L x . 

The edge of the 'geometrical' shadow of the straight edge is at 
P on a screen through P perpendicular to the axis SOP. There is 
not, however, a clear-cut edge to this shadow because of the diffrac- 
tion. The manner in which the intensity of light varies in the shadow 
about Pis deduced from the variation in position of point Oi on the 
Cornu spiral (Fig. 6.12). As 0\ moves into the top half of this spi- 
ral, it is clear that the vector 0\L\ diminishes progressively in 




INTENSITY 

Fig. 6.13. The diffraction of a cylindrical wave by a straight edge 

length; the intensity, proportional to (OiLif, will likewise diminish 
progressively to become zero when Oi reaches L\. As Ox moves into 
the bottom half of the Cornu spiral, it is seen that the intensity 
is always greater than that represented by (OLi) 2 , increases at first, 
and then fluctuates as Oi moves around the turns of the lower spiral 
until L 2 is reached. Eventually the intensity is represented by (LiL 2 ) 2 , 
four times that at the geometrical edge of the shadow at P. 

This variation in light intensity with position of the point Pi is 
shown in Fig. 6.13. Note that within the diffraction bands near the 
point P but above it, the intensity maxima are greater than the inten- 
sity well outside the shadow. 



6.8 The Diffraction of a Cylindrical Wave by a Rectangular Aperture 

For a cylindrical wave diffracted by a rectangular aperture, a fixed 
length of the arc of the Cornu spiral, decided by the effective section 
of the wavefront W'm the aperture (Fig. 6.14), moves around the 



TREATMENT OF DIFFRACTION AND INTERFERENCE 267 





Fig. 6.14. Diffraction of a cylindrical wave by a rectangular aperture (arc QiQ 2 

is constant, but vector QiQ 2 depends on the positions of points Qi and Q t 

on the Cornu spiral) 

spiral as the pole Ox on the wavefront moves with the point Pi on 
the screen Z. The corresponding vector Q1Q2 will have a length 
depending on the position on the spiral of the arc QiQ 2 - Whereas 
the arc length Q1Q2 is constant, the straight line Q1Q2 will vary in 
length. The magnitude of (QiQif will be proportional to the inten- 
sity at the point Pi on the screen Z. 

The diffraction pattern obtained is generally complex and will 
depend on the aperture width. 



6.9 The Diffraction of a Cylindrical Wave by a Rectangular 
Obstacle 

If the rectangular aperture (Section 6.8) is replaced by a rectangular 
shield obstructing the central part of the wavefront (Fig. 6.15), 
only the two outer regions of the wavefront are effective. These 



268 



UNIVERSITY OPTICS 





VECTOR OF LENGTH 
PROPORTIONAL TO 
AMPLITUDE AT P, 




Fig. 6.15. Diffraction of a cylindrical wave by a rectangular obstacle 

regions will be represented by vectors Q1L1 and Q 2 L 2 on the Cornu 
spiral, where the positions of gi and Q 2 depend on the position of 
point Pi on the screen. The two separate vectors Q\L X and Q 2 L 2 
are added to give a resultant vector of length proportional to the 
amplitude at Pi. 



6.10 Fraunhofer or Far-field Diffraction 

There is a range of phenomena in which the distances of the object 
and image from the aperture or apertures are large compared with 
the wavelength of the radiation and the aperture dimensions. This 



TREATMENT OF DIFFRACTION AND INTERFERENCE 269 

means that the wavefronts in the aperture are virtually plane, and 
parallel bundles of rays are involved. Such diffraction is termed 
Fraunhofer, in contrast to the type of diffraction already discussed, 
known as Fresnel. 

Whilst this division between Fraunhofer and Fresnel diffraction 
is generally agreed upon, there is not an arbitrary dividing line 
between the two types of diffraction phenomena. In practice, the 
dividing line will be determined by the resolving power of the record- 
ing apparatus. 



PLANE WAVEFRONT 




POINT SOURCE OF 
MONOCHROMATIC LIGHT 
AT FOCAL POINT OF Li 

Fig. 6.16. Arrangement for producing far-field diffraction 

For example, if the pattern in the second focal plane of lens L 2 
(Fig. 6.16) is recorded by some means, the result might be termed 
an ideal Fraunhofer pattern because both object and image are in 
effect at infinity. A number of phenomena occur in this way, and 
no doubt arises. On the other hand, exactly the same pattern may 
be observed as in the position of the focal plane of lens L 2 if no lenses 
are present at all but if the object and image distances from the aper- 
ture are large enough for the recording method to be incapable of 
detecting a difference between Fraunhofer and Fresnel diffraction. 
It is thus simpler to refer to all diffraction effects involving parallel 
diffracted rays as Fraunhofer or far-field effects, and any other as 
Fresnel. This distinction is used in this text. 



6.11 Fraunhofer Diffraction Due to a Single Narrow Slit 

The analysis of Fraunhofer diffraction due to a single narrow slit 
has been given in Section 11.2 of Volume 1 but is repeated here 
in modified form. 

The slit of width a is uniformly illuminated by normally incident 
collimated light which is monochromatic of wavelength L The slit 
is considered to be divided into equal elements parallel to the length 
of the slit and each of width dx (Fig. 6.17). If R is the amplitude 
of the wave in the plane of the slit, then the amplitude dR e in the 



270 



UNIVERSITY OPTICS 



INCIDENT COLLIMATED 

LIGHT OF 

WAVELENGTH X 




Fig. 6.17. Fraunhofer diffraction due to a single narow slit 

direction s (where the vector s is at angle to the direction of the 
incident light) due to the strip of width dx is given by 

d/?e = Ro exp ( }kx sin 0) dx 
where k = 2n/X. For the whole slit of width a, therefore, 

a 

C R 

Re = Rol exp (jfcx sin 0) dx = ° - [exp(jfcasin 0)- 1] 
o 
The intensity I» is given by 

Is oc RqR$ 

where Re is the complex conjugate of Re- Therefore 

.Rg[exp (jfcq sin 0)- 1] [exp (-}ka sin 0)- 1] 
IeCC -/fc 2 sin 2 

or 

/?§ sin 2 [(to sin 0)/2] 



/s oc 



As a is a constant, 



h oc 



-A: 2 sin 2 
Rlsin*[(kasmd)/2] 



[(ka sin 0)/2] 2 
But / oc /?§. Therefore, substituting k = InfX, 

Ie _ sin 2 [(jra sin 0)/A] _ sin 2 a 
To _ [(rca sin 0)/A] 2 a 2 



(6.24) 



TREATMENT OF DIFFRACTION AND INTERFERENCE 



271 




DIRECTION OF VECTOR REPRESENTING 
EFFECT OF SLIT ELEMENT AT x=a 



DIRECTION OF VECTOR REPRESENTING 
EFFECT OF SLIT ELEMENT AT x = 



^=oc 



Fig. 6.18. Vector diagram showing summation of effects due to elements into 
which a single slit is divided (see also Fig. 11.2 of Volume 1) ; the vector OA 
represents R, the resultant of an infinite number of vectors along the arc OA) 



where a = (rta sin 0)/A. Also 

Re _ sin a 
Ro~ cc 

The complex amplitude will involve the phase angle <j>, as shown 
by the vector diagram of Fig. 6.18 (see also Fig. 1 1.2 of Volume 1); 
as <t> = x, the complex amplitude is given by i?o[(sin «)/a] exp joe. 



6.12 Fraunhofer Diffraction Due to a Rectangular Aperture 

If the width of the slit is comparable with its length, two-dimen- 
sional Fraunhofer diffraction by a rectangular aperture is con- 
cerned. Suppose such an aperture has a corner at the origin O of 
perpendicular axes Ox and Oy and has sides of length a along the 
x-axis and b along the >>-axis [Fig. 6.19(a)]. Let 6A be a small 
element of area in the aperture in the plane xOy having a reference 
vector r with respect to the origin O. In Fig. 6.19(b), $ is a unit 
vector in the diffraction direction considered. If the direction 
cosines of s relative to the x, y and z axes are respectively /, m and «, 
the path difference between the parallel wavetrains 1 and 2 is r.s 
and the corresponding phase difference is kr.s, where k = 2njX. 
Note that the scalar product r.s has a magnitude r.?cos(r, s) 
where (r, s) is the angle between vectors r and s. 

The vector s prolonged to a far plane beyond the rectangular 
aperture, where this distant plane is parallel to the aperture plane, 



272 



UNIVERSITY OPTICS 



x' 




dA 


1 


dx| rzr^ 




a 

1 


r /Sf 

/ 








- — b — - 


y 



PATH DIFFERENCE BETWEEN 
RAYS I AND Z=r.s 



*K< 



I 



(a) (b) 

Fig. 6.19. Fraunhofer diffraction at a rectangular aperture 

will meet this plane at a point Q. The direction cosines of the 
vector joining dA to Q will be / and m relative to the x and y axes 
respectively; the direction cosine n with respect to the z-axis will 
be zero, because the z-axis is normal to both the aperture plane 
and the far plane. The complex amplitude at this distant point 
Q is R,, m given by 



Ri m oc J dA exp (jkr.s) 



where A is the area of the aperture. This expression becomes in 
Cartesian coordinates 



a b 



Ri.mCC \exp[jk(lx+my+nz)]dxdy 
o o 

a 

exp[jk(lx+my)] ] 6 



oc 



oc 



J{ 


a 

J 



]km 
exp ( jklx) 



I 



dx 



oc 



oc 



jkm 
exp (jklx) 



-k?ml 



[exp (jkmb)— 1] dx 



[exp(}kmb)— 1] 



- [exp (jkla)— 1] [exp (jkmb)— 1] 



k*ml 



Therefore 



I Rl, m I 2 = Rl, mRl.m CC 



sin 2 ai sin 2 a 2 
(k 2 mlf 



TREATMENT OF DIFFRACTION AND INTERFERENCE 



273 



where ai = jr/a/A and « 2 = rnnb/A. Therefore, as a, b and X are all 
constants, 

IH.I-x^^L- (6.25) 



For a maximum intensity in the diffraction pattern, 

n 



and 



ai = (2/>+l)- 



a 2 = (2tf+l) 



where /» and # are integers. Thus 



la 



(2/>+l) T 



and 



mb = {2q+\)- 



(.6.26) 



(6.27) 



Considering diffracted rays from the corners O, A and B of the 
aperture (Fig. 6.20), it is seen from Equations 6.26 and 6.27 that 
the condition for a maximum in the direction s, apart from the 
central maximum, is given by 



and 



a.s = (2p+l)^ 



b.s = (2q+l)- 



because / = cos ip a , m = cos y> b and s is a unit vector. 




Fig. 6.20. Diffracted rays from a rectangular aperture 



274 UNIVERSITY OPTICS 

6.13 Fraunhofer Diffraction Due to a Circular Aperture 

The circular aperture centre O is illuminated uniformly by mono- 
chromatic light of wavelength A incident along the axis Oz normal 
to the aperture. The Fraunhofer amplitude R s in the direction of 
the unit vector s [Fig. 6.21(a)] is given by 



R s = J R exp (jkr.s) dA 

where dA is an element of area about point P in the aperture, 
k = 2njX, OP is a vector of length r in the plane of the aperture, 
Ro is the amplitude of the wave in the plane of the aperture, and A 
is the area of the aperture. 



CIRCULAR APERTURE 
'IN SHIELD IN xOy PLANE 



COLUMATEO 
INCIDENT LIGHT 
OF WAVELENGTH \ 




CENTRE OF APERTURE 



fa) 



d 


k 




«-u 




k 





l-ZZr \^ y 


* 



(b) 
Fig. 6.21. Fraunhofer diffraction at a circular aperture 



TREATMENT OF DIFFRACTION AND INTERFERENCE 275 

Choosing cylindrical coordinates so that point P has coordinates 
(r, <f>, 0) with respect to the centre O of the aperture, 

2rc a 

Rs = j j Ro exp (jkr.s)r d$ dr 
a o 

The solution of this integral is of the form 

K s QC — -j— 
<P 

where Ji(<j>) is the first order Bessel function and <f> = (2ita sin 0)/A. 
A plot of R,/R against 4> is shown in Fig. 6.21(b). Note that 
R, is zero at an angle $ given by \-2ti. 

6.14 The Babinet Principle of Complementary Screens 

Suppose a screen £>i contains any distribution of opaque and trans- 
parent portions (for example, £>i may be a flat opaque sheet con- 
taining a number of holes). The screen Z) 2 complementary to D t 
has the opaque portions where D% has transparent ones and vice 
versa. Thus an example of D\ as a flat opaque sheet containing a 
series of holes would have complementary screen D 2 in the form 
of a transparent glass plate containing a like series of opaque 
patches where there were holes in D x . 

Let Q be a point in a field viewed by an observer. A source of 
radiation exists which illuminates this observed field, but the 
disturbance due to this radiation at Q is zero. For example, Q may 
be just outside the direct light produced by a luminous source. 

The diffracting screen or shield D x is introduced between the 
source of radiation and the field of view. As a consequence, a dis- 
turbance of amplitude Ri is produced at Q; point Q becomes illu- 
minated because of diffraction due to D u even though with D x 
absent there is no illumination at Q. Babinet's principle states that 
the intensity at Q due to the introduction of D x is the same as 
that which would be produced by the introduction, instead, of the 
complementary screen D 2 . 

This principle is easily proved. Thus, suppose the disturbance 
at Q due to the introduction of diffracting screen D 2 has an ampli- 
tude R 2 . Any radiation received at Q must be through the trans- 
parent portions of the screen. If the transparent portions of both 
screens D x and Z) 2 were imagined to be present simultaneously, 
the radiation would be unimpeded and the disturbance at Q would 
be zero. It follows that R^^-R^ and R\ = Rf. Complementary 
screens therefore produce identical diffraction patterns in those 
regions of the field of view outside the direct light from the source. 



276 



UNIVERSITY OPTICS 



6.15 Kirchhoff's Formulation of Huygens' Principle 

As stated at the beginning of this chapter, the Fraunhofer and the 
Fresnel elementary treatments of diffraction are both approximate, 
because they do not take into account changes of phase on re- 
radiation from a Huygens' secondary source and give no exact 
analysis of obliquity. The Kirchhoff treatment accounts for both 
these factors, but ignores polarisation ; it is therefore only strictly 
true for longitudinal waves or for electric vectors all acting in the 
same directions. Furthermore, no account is taken of diffraction 
effects at a field of view very close to the diffracting obstacle. The 
Kirchhoff analysis is thus not a full one and may be considered as a 
mathematical formulation of ideas which were already held by 
Fresnel. 

The problem is to find the disturbance at a point Q in terms of 
the integrated disturbances over the wavefront W arising from a 
source of radiation S [Fig. 6.22(a)]. The analysis involves the use 





(b) 
Fig. 6.22. Kirchhoff's theory of diffraction 



TREATMENT OF DIFFRACTION AND INTERFERENCE 277 

of Green's theorem, expressed in the form: 

j(u grad v—v grad u). dA = j" (u v 2 v-v v 2 w) dr (6.28) 

A T 

Here, u and v are two scalar quantities assumed to be single-valued 
and continuous in a given volume r, where dx is an element of 
volume within x. The element of area dA is on the bounding surface 
of the total area ^enclosing the volume x, and the vector dA = ndA, 
where n is a unit vector drawn in the outward normal direction 
atdA. 

It is assumed than that the wave surface FT completely surrounds 
the point Q and that the source S is outside it [Fig. 6.22(b)]. The 
functions u and v are chosen as spherical wave amplitudes satis- 
fying the wave equation as follows: 

u = CXP <-**'> (6.29) 



and 



^expj-j^) (6.30) 



where k = lrc/2., the distance r is measured from point Q to any 
point in the surface enclosing the volume, and the distance r s is 
measured from point S to any point in the enclosing surface. 
Since v -~ « as r -»■ 0, which is inadmissible, the point Q must 
be excluded from the region of integration. To do this, Q is sur- 
rounded by a sphere of very small radius; over the surface of this 
sphere, the amplitude of the disturbance is assumed to be constant 
and equal to the amplitude u q at Q which it is required to find. 
As there are no sources of radiation within the volume t, Pois- 
son's equation gives 

v 2 « = v 2 v = (6.31) 

Equation 6.28 therefore becomes simply 

J" (w grad v—v grad u).dA =0 (6.32) 

A 

Consider first the value of this integral when the enclosing sur- 
face W is a sphere of radius r around point Q. The element of 
area dA is then r 2 dco, where dco is the solid angle subtended by 
dA at Q. Then, 



/= (u grad v—v grad u).dA 

sphere 

= \ U ^~ (—«)—'*> grad w . nr 2 dco (6.33) 

sphere 



278 UNIVERSITY OPTICS 

because n and r are in opposite senses so that the unit vector along 
grad vis —n. Differentiating Equation 6.30 with respect to r gives 

3« -jkr exp (-j£r)-exp (-jkr) 

dr ~ r 2 

Substitution in Equation 6.33 gives 

/= j [r exp (- jkr) (jku- grad u.n)+u exp (-jkr)] dco 

sphere 

For the sphere of very small radius around point Q over the 
surface of which u = u q is a constant (i.e. u — u q as r — 0), the 
integral / becomes /' given by 

/' = j u q dco = 4nu g (6.34) 

sphere 
r-*0 

The total integral for a wavefront W of any shape enclosing point Q 
is therefore given by /' for the small sphere around Q plus the 
integral given by Equation 6.32 for the surface W. So the appro- 
priate expression is 



4nu q + j(u grad v—v grad u).dA - 



Therefore 



(6.36) 



+ 



Uq = 4n \ ^~" grad V+V grad ")- d/ * ( 6 - 35 ) 

w 

or, substituting for u and v from Equations 6.29 and 6.30 re- 
spectively, 

W 

— exp (-jA;r) grad f— exp (-jkr s ) 

This is Kirchhoff's formula giving u q , the amplitude of the distur- 
bance at point Q in terms of the integrated disturbances over a 
closed wave surface W enclosing Q. The function u is the complex 
amplitude of the disturbance arriving at W from the source of 
radiation S, and the integral is the summation of the wavelets 
from W to Q. 

This theory may be applied to optical problems involving trans- 
verse waves when the source S and the point Q both lie on the 
axis of a circular aperture in an opaque screen. In this special 
situation, the amplitude and phase are both independent of polar- 



TREATMENT OF DIFFRACTION AND INTERFERENCE 279 

isation and Equation 6.35 is identical for longitudinal and transverse 
waves. Equation 6.35 may be written in the form 



because 



and 



grad u.dA = -=— dA 
en 



grad v.dA = -=- dA 
dn 



Substituting for u and v from Equations 6.29 and 6.30 respectively, 
therefore, 

u " = i \{v exp ( - jA:r) £-"-fe [} exp <-&>]} dA 

w 
i.e. 



1 .... 8m \ku exp (— \kr) dr 

(6.38) 



-„«p(-j*,)J-(I)]«L4 



6.16 Kirchhoff's Theory of the Diffraction of a Spherical Wave by 
a Circular Aperture 

The surface of integration is made the part of the spherical wave- 
front AOB in the circular aperture, the opaque screen containing 
this aperture and an infinitely distant hemisphere (Fig. 6.23). 
The contributions due to the opaque screen and the hemisphere 
are assumed to be zero leaving the wavefront in the aperture only. 
Let the complex amplitude of the vibration in the circular aperture 
be 

_ exp (-jfr,) 

"o 

where r s is the distance from the point source S to the spherical 
wavefront in the aperture. 

For an element of area dA in this wavefront, let the distance to an 
axial point P beyond the aperture be r. The centre of the aperture 
is at O and the axial distance OP = r p . 



280 



UNIVERSITY OPTICS 




/ INFINITELY 
•*— DISTANT 
/ HEMISPHERE 



Fig. 6.23. Diffraction of a spherical wave by a circular aperture 



The normal n to the wavefront drawn outwards from the closed 
surface is towards the source. If the angle between this normal 
and the radius vector r is 6, then 



dn 



= cos d 



Substituting in Equation 6.38, but for u p (the complex amplitude 
of the disturbance at point P), 

u - l f J l ™ ( iM 9 f ex P (~Jfo , «) 1 



aperture 



jteoexp(-jAr) , ., . 8 /1\1 

+ ~ cos e-u„ exp (-Jkr)-^ ijj \ 



AA 



r exp (-jfcr,) I 



In this expression 
8 fexp (-jkr s ) 
Sn 

As 

and 



-jkr s exp (—jkr s )—exp(—jkr s ) dr s 
tj dn 



u = 



exp (-jkr s ) 



9rj 
8b 



= 1 



TREATMENT OF DIFFRACTION AND INTERFERENCE 281 

therefore 

a r exp(-j^) i jj. 

Thus 

aperture 

As 

3— = cos d 
an 

then 

r jA: exp (-jkr) (1 +cos 0) 



« 4n J [ 

(6.39) 
dA 



exp (— jAr) exp (— jkr) cos 

"1 o 



rr, 



The physical interpretation of this Equation 6.39 is that there 
are, in effect, three wavelets emanating from a Huygens' secondary 
source: 

1. A wavelet depending on the area dA, \jr and an obliquity 
factor 1 +cos in interpretation of the term [jA: exp (- j/tr)(l + 
cos 0)]/r; 

2. A wavelet depending on the area dA and l/r but with no 
obliquity, decided by the term [exp (-jkr s )]/rr s , where r s is 
the separation between source and aperture; 

3. A wavelet depending on the area dA and varying inversely 
with r 2 and with an obliquity factor cos 0, given by the term 
[exp (-jkr) cos 0]/r 2 . 

The wavelets in the first two of these cases, in which obliquity 
factors are involved, are illustrated in Fig. 6.24. 

Note also that the term [jk exp (—jkr) (I +cos 0)]/r involves a 
phase change of jr/2 on radiation from the secondary source. This 
change in phase on radiation from a point source has been observed 
in microwave experiments involving interference between plane 
and spherical waves. It was first noted by Gouy in 1890 using inter- 
ference in the visible region. 

In the simple Fresnel treatment, r and r s are both large compared 
with the wavelength, and the obliquity angle 6 is small. Hence 
only the term [jk exp (— j/cr)(l +cos 0)]/r is effective, involving the 
phase change of n/2. The obliquity factor with small becomes 
1 +cos = 2. 



282 UNIVERSITY OPTICS 




(a) (b) 

Fig. 6.24. Effect of obliquity factors on wavefronts from a point source: (a) 
Huygens' source with obliquity factor 1 + cos 6; (b) Huygens' source with ob- 
liquity factor cos 



6.17 Interference and Associated Phenomena 

In the previous sections on diffraction, the spatial amplitude 
distribution arising from a given wavefront in a given aperture of 
finite dimensions is considered. In the following sections, the con- 
cern is the spatial amplitude distribution arising from a number of 
separated coherent sources. Two sources of radiation are coherent 
if they send out wavetrains having a phase difference which is 
independent of time. The coherent sources may be a set of apertures 
which are not necessarily identical, and the problem of finding 
the spatial amplitude distribution then involves both diffraction 
and interference. The diffraction problem is solved by determining 
the diffraction effect of each aperture at the point in the field of 
view considered, and the interference problem by considering the 
effect at this point of separated wavetrains from different apertures. 
The final effect is found by considering the two phenomena to- 
gether. A full treatment is beyond the present scope. To limit 
the scope, the important cases discussed are those where identical 
sets of coherent sources in two dimensions are involved and the 
diffraction is of the Fraunhofer type. Interference may also arise 
from a set of coherent scatterers in one, two or three dimensions 
(for example, in X-ray diffraction). 



6.18 Coherence 

Suppose a set of sources of radiation gives rise to wavetrains which 
interfere, but that the diffraction effects are uniform in all directions 
so need not be considered. Let the complex amplitudes u u u%, u 3 , 



TREATMENT OF DIFFRACTION AND INTERFERENCE 

., m„ from sources 1, 2, 3, . . ., N be represented by 
«i = .Ri exp (j#i) 
m 2 = i?2 exp (j*0 2 ) 
w 3 = jRs exp (j<£ 3 ) 



283 



u„ = R„ exp (j0„) 



where the time factor is omitted, and all the vibrations are of the 
same frequency and assumed to be in the same direction. 

By the principle of superposition, these amplitudes may be 
added vectorially in the usual way (Fig. 6.25) to give the resultant R. 
Clearly, 

(n=N \2 /n—N \2 

£ R n sin 4> n \ + t b £ R n cos 4>A (6.40) 



or 



^ = ^^+2 £ ^^(sin^sin^+cos^cos^) (6.41) 



n^m 



Assume that the angles 4> a have random values, so that the phase 
angle <j> m ~ <£„ between any two wavetrains n and m also takes on 
random values over a period of time. As all values of <f>„ may be 
assumed to be equally probable, the mean value & of jR 2 over a 
finite time is given by 



n=N 



*= Y.RI 



n=l 



because sin <j>n sin <j> m and cos <j>„ cos <f> m in Equation 6.41 average 
out to zero over an integral number of complete cycles. It follows 



IMAGINARY 




REAL 



Fig. 6.25. Vector addition of amplitudes to give the resultant amplitude 



284 UNIVERSITY OPTICS 

that if (j) n ~ <j> m has random values, the resulting intensity propor- 
tional to fi? is equal to the sum of the individual intensities. The 
effect at any point is therefore simply a photometric scalar effect, 
and no interference occurs. Sources such as these are termed 
incoherent. 

For coherent sources, <f> n ~ (f> m must be independent of time and 
depend only upon the spatial coordinates of the point at which 
interference takes place. 



6.19 The Classification of Interference Phenomena 

In all interference phenomena, two or more wavetrains, which 
interfere spatially, are involved. An obvious means of classification 
is based on the methods used to produce the necessary coherent 
Huygens' sources which give rise to these wavetrains. In all methods 
a single wavefront is used to form two or more secondary sources. 

In division of wavefront methods, a section of an extended source 
is used to give a plane wave having as high a degree of coherence as 
is compatible with reasonable intensity. This plane wavefront is di- 
vided by apertures in a screen which then form the interfering 
sources. A spatial interference pattern is established beyond the 
screen and is modulated by the diffraction effect at the separated 
apertures. The result may be regarded as a 'pure' interference effect 
if the apertures have infinitely small dimensions so that the diffrac- 
tion effects will tend towards uniformity in all directions. An exam- 
ple is the plane transmission diffraction grating, which comprises 
a large number of parallel close thin slits in a plane screen. A plane 
wavefront falling normally on these slit apertures gives rise to inter- 
fering sources. 

In division of amplitude, a given wavetrain is divided into two 
wavetrains, one reflected and the other refracted at the boundary 
between two optical media. If the wavetrains are superimposed 

INCIDENT BEAM (PLANE WAVEFRONT) 



INTERFERING BEAMS 



REGION OF INTERFERENCE 
FOR A VERY THIN FILM 



B 

Fig. 6.26. Interference of light in a thin film 




TREATMENT OF DIFFRACTION AND INTERFERENCE 



285 



after travelling through different optical paths, then interference 
takes place between them. An example is interference in thin films. 
Referring to Fig. 6.26, note that the film must be very thin if the 
reflected and refracted wavefronts are to be superposed; indeed the 
film thickness must be of the order of the wavelength of the inter- 
fering light. 

If the reflecting surfaces A and B in Fig. 6.26 are separated by 
a distance appreciably greater than the wavelength of the radiation, 
the emergent rays 1 and 2 are separated. Now there is both division 
ofwavefront and of amplitude. 



6.20 The General Forms of Fringes due to Interference in Thin 
Films 

Let A and B (Fig. 6.27) be two reflecting surfaces at a small angle « 
to one another and enclosing a thin film which, for simplicity, is 
assumed to be air of refractive index n = 1 . A ray from an extended 
source meets the surface AatP and is reflected along PE. The same 
incident ray will also traverse the thin film and be reflected from 
the surface B to give rise to a ray (not shown in Fig. 6.27) exceed- 
ingly close to PE which will interfere with the reflected ray from A 
provided that the angle a and the film thickness are both very small. 
An observer at E will thus see a localised interference effect in the 
region of P. Let EO be normal to the surface A at point O. Taking 
O to be the origin of Cartesian axes Ox and Oy, the coordinates of 




Fig. 6.27. Interference on reflection from a thin film (the film thickness and 
angle a are both greatly exaggerated; EO is normal to the surface A) 



286 UNIVERSITY OPTICS 

point P are (x, y) and OP = V(x*+y 2 )- The path difference A be- 
tween the two interfering wavetrains reaching E is given (from 
Equation 9.1 of Volume 1) by 

A = 2t p cos <t> (6.42) 

where t p is the thickness of the film at point P, and 4> is the angle 
of refraction in the film which, for an air film, is the same as the 
angle of reflection of the ray PE. This angle to the normal EO is 
^.PEO = </>. As ^POE is a right angle, it follows that 

,_A_ h _ h 

If the thickness of the film at O (i.e. along the normal to the film) 
isf„, 

t p = t +xx (6.44) 

provided that the .y-axis is parallel to the line of intersection of sur- 
faces A and B and the perpendicular x-axis is in the plane A. Sub- 
stituting from Equations 6.43 and 6.44 into Equation 6.42 gives 

. _ 2(to+xx)h 



V(A 2 +x 2 +y 2 ) 

The locus of a given interference fringe is hence given by 

x 2 (A 2 -4A 2 a 2 )+^ 2 A 2 -8f aA 2 x + A 2 (A 2 -4^) = (6.45) 

where x and y are the coordinates of points on the locus with re- 
spect to the origin O (where the normal from the eye meets the sur- 
face A), h is the distance of the eye from this origin, a is the small 
angle between the reflecting surfaces A and B, t is the thickness of 
the film at the origin O, and A, the path difference, is a constant 
for a given fringe. 

In general, Equation 6.45 is that of a conic section and takes the 
following forms: 

1. An ellipse if A 2 > 4A 2 a 2 ; 

2. A hyperbola if A 2 < 4/i 2 « 2 ; 

3. A parabola if A 2 = 4h 2 x 2 ; 

4. A circle if « = 0; 

5. A pair of straight lines if A = 0. 

The path difference A is small for very thin films (t p small), in which 
case the fringe forms will be hyperbolic or circular; they will in 
fact be circular if the film surfaces are accurately parallel (x = 0), 
giving the Haidinger-type fringes of equal inclination already de- 
scribed in Chapter 9 of Volume 1. 



TREATMENT OF DIFFRACTION AND INTERFERENCE 287 

The eccentricity of the hyperbola will increase with decreasing 
path difference A until the limiting case where A approaches zero, 
when the fringes will be straight lines. 



6.21 Fizeau Fringes 

An imperfectly collimated beam of monochromatic light is inci- 
dent upon a thin film at a point P (Fig. 6.28). If bl is the angle be- 
tween extreme rays in the cone of light, a range of angles of inci- 
dence to the normal exists between/ and /+ 51. If the interference 
in the reflected light fringe in the region of P is dark, for the pth 
order 

pi = 2nt cos R (6.46) 



I NORMAL 




THIN FILM 



Fig. 6.28. Incidence of imperfectly collimated light on a thin film (the film 
thickness is greatly exaggerated) 

where A is the wavelength of the light, n is the refractive index of the 
material of the film, R is the angle of refraction in the film, and t is 
the thickness of the film at point P (see Equation 9.3 of Volume 1). 

But there is not only a single value of the angle of refraction R: 
a range of values exist because of the range of angles of incidence 
from / to I+dl. The effect of the range of angles of incidence is to 
move the order/? at P through a range corresponding to a change of 
thickness of the film from t to t+bt. 

For a given order of interference p, pX is a constant. Differenti- 
ation of Equation 6.46 therefore leads to 

= 2nt cos R bt-lnt sin R bR 

where bR is the range of values of R corresponding to bl. Thus 

bt = * tan RbR (6.47) 

From Equation 6.47, it is seen that fringes of the thin film type can 
be produced when t is large provided that tan R is small (i.e. the 



288 



UNIVERSITY OPTICS 



incident beam is along or near the normal) and dR is small (i.e. the 
incident beam is highly collimated). 

Fringes of this type are called Fizeau fringes ; they are widely used 
in testing optical components. The essential features of the neces- 
sary optical system are two separated non-silvered surfaces, which 
may be widely separated, and a small point source of light at the 
focal point of a highly corrected converging lens and thus giving 
a plane wavefront incident nearly normally on one of the surfaces. 

An arrangement for observing Fizeau fringes [Fig. 6.29(a)] 
comprises a small point source S of monochromatic light at the 



MAXIMUM MINIMUM 




Fig. 6.29. Observation of Fizeau fringes 



focal point of a highly corrected collimator lens L. The collimated 
light from L is incident slightly away from the normal upon a thick 
glass plate PQ on a black table. Suppose this glass plate has plane 
surfaces P and Q inclined to one another at a small angle, forming 
a wedge. Two plane waves leave this wedge, one reflected from its 
front-surface P and the other from its back-surface Q. The crests 
and troughs in these reflected waves before they enter the lens L 
are represented by full lines and dashed lines respectively in Fig. 
6.29(b). It is seen that along lines such as line 1 , the crests and troughs 
in one reflected wave coincide with those in the other and so produce 
constructive interference; along lines such as line 2, crests coincide 
with troughs to give destructive interference. In a plane such as 
AB, a fringe system is observed with maxima and minima where 
it is intersected by the loci of line 1 and line 2. These loci intersect 
the glass plate at thicknesses oflnt = {p+\)k for maxima and 
2nt = pX for minima. 



TREATMENT OF DIFFRACTION AND INTERFERENCE 289 

The plane wavefronts in the reflected light become spherical after 
traversing the lens L. The eye placed to receive the light emerging 
from L is able to discern, in relation to these fringes, departures from 
flatness of the surfaces P and Q of the glass plate and lack of paral- 
lelism between P and Q. 



6.22 Fraunhofer Diffraction Treated by Fourier Methods 

A powerful theoretical method in dealing with problems of diffrac- 
tion makes use of the Fourier transform. An introduction only to 
the fundamental ideas involved is given here. 

In the most general case, the analytical problem in diffraction 
can be stated in the form that, in the propagation of radiation, the 
amplitude and intensity distribution are known at a given region 
and it is required to find the amplitude and intensity distribution at 
a separate region. For Fraunhofer diffraction, the known distribu- 
tion is where the diffracting obstacle (such as an aperture) is placed 
(a region sometimes called the 'object space', not to be confused 
with the use of this term in relation to lenses), and it is required to 
evaluate the distribution at a large distance in the image or Fraun- 
hofer space where the diffraction pattern of interest is formed. 

As the Fraunhofer space is at a large distance compared with the 
dimensions of the diffracting obstacle and the wavelength of the 
radiation, the interfering wavetrains that give rise to the diffraction 
pattern may be considered as parallel bundles of radiation leaving 
volume elements in the object space in a specified direction. In Fig. 
6.30, therefore, let P and Q be the centres of two small equal volume 
elements 8V in the object space (i.e. where the specified distribution 
is known in relation to a diffracting obstacle at this position). 
Suppose that bV is small enough to contain a uniform density of 
radiating Huygens' sources of secondary wavelets. P and Q are in 
positions relative to an origin O decided by the vectors r p and r q 
respectively, and r is the vector joining P and Q. The magnitudes 

OBJECT SPACE 
/^^/ , TO FRAUNHOFER 




SPACE 



Fig. 6.30. Specification ofpoistions within the object space in the Fourier treatment 
of Fraunhofer diffraction 



290 UNIVERSITY OPTICS 

r p , r q and r of these vectors are all small, because the diffracting 
obstacle dimensions are small compared with the distance to the 
Fraunhofer space. 

Assume that the specified spatial amplitude distribution at a point 
such as P in the object space is given by 

u=f(r p ,B,4>) (6-48) 

where r p , and <£ are the polar coordinates of P with respect to the 
origin at O, and/is a specified continuous function. The Huygens' 
source of secondary wavelets at P will produce at any point R at 
a distance r' from P a spatial amplitude distribution given by 

u' = g'(r', 0', $') 

where g' is a continuous function, and r', 6' and #' are the polar 
coordinates of R with respect to an origin at P. At unit distance 
from P, r' = 1 and 

W = gW, </>') 

Suppose the point R is in the direction of a unit vector s emanating 
from P. Then the specification of s corresponds to the specification 
of 6' and <]>', so u' may be written as 

W = g(s) (6.49) 

where g is a continuous function. 

This radiation from P in the direction determined by s will inter- 
fere with the parallel radiation from Q in the direction s. It will be 
this interference (which may in practice only occur via the action of 
a lens focusing the Fraunhofer diffraction pattern at a screen) 
which decides the resultant amplitude in the Fraunhofer space (i.e. 
in the diffraction pattern at, or because of the focusing lens effec- 
tively at, a large distance) due to P and Q combined. However, the 
radiation in the direction of s from Q differs in path by r cos a 
from that produced by P, where a is the angle between r and s 
(Fig. 6.30). The corresponding phase difference will be kr cos a, 
where k = 2njX, X being the wavelength of the radiation. Therefore, 
whereas the amplitude of the electric vector in the radiation due to 
P is proportional to g(s) and is perpendicular to s, that due to Q 
will also be g(s) but at angle a to s. The complex amplitude 
resulting from the addition of the two is hence proportional to 
g(s)exp(jkr.s), where r.s is the scalar product of r and s of 
magnitude rs cos x. 

As the specified spatial amplitude distribution at P is given by 
Equation 6.48 and there is a uniform density of Huygens' sources in 
the small volume element bV, it follows that the complex amplitude 



TREATMENT OF DIFFRACTION AND INTERFERENCE 291 

in the Fraunhofer space due to P and Q is given by 
8U' = af{r p , 0, <p)g(s) exp Qkr.s) bV 

where a is a constant of proportionality. 

Assuming that there are no discontinuities in the radiation be- 
tween the diffracting obstacle and the Fraunhofer space, the resul- 
tant complex amplitude U due to all the volume elements bV 
around all points such as P and Q in the diffracting obstacle is 
given by 

oo 

U = g(s) $f{r p ,0, </>) exp (jfcr.s) dF (6.50) 

— oo 

for a given direction s. Note that g(s) is outside the integral sign, 
which is justifiable on the basis that it is dependent only on the 
fixed direction decided by s. 
In the Appendix (page 362) it is shown that the integral 

oo 

ff(r P , 6, <f>)exp(jkr.s)dV 

— oo 

is the Fourier transform of f(r p , 0, </>). 

It is helpful to use Cartesian coordinates, in which event r.s = 
(x g -x p )l+(y q -y p )m+(z9-Zp)n, where P is the point (x p , y p , z p ) 
with reference to the origin at O, and the direction cosines ofs 
are /, m and n with respect to a right-handed set of Cartesian 
axes at O. The integral then becomes 

j$$f( x > y, z ) «P [jk(lx+my+nzy\ dxdydz 

and V is a function of /, m and n. 

This general treatment shows that a very simple relationship exists 
between the Fraunhofer object and image spaces in that the ampli- 
tude distributions are Fourier 'mates'. Thus, in the analysis, U in 
Equation 6.50 is the Fourier transform of g(s)f(r p , 6, $) and, simi- 
larly, the whole process may be reversed to consider the distribution 
in the object space if that in the image space is specified. 



6.23 The Aperture Function: Fraunhofer Diffraction due to 
a Single Slit 

The value of g(s)f(r p , 6, <j>) gives the amplitude due to P only at 
a unit distance along the direction of vector s. This is known as the 
aperture function A. If the Huygens' wavelets from such a point 
as P have spherical wavefronts, g(s) is a constant, having the same 



292 



INCIDENT PLANE 



UNIVERSITY OPTICS 

, SCREEN 

.NARROW SLIT OF WIDTH a 



^-— -ST 



WAVE 



DISTANCE»a 



(a) 



x-AXIS IN 
OBJECT SPACE 



SCREEN 



x'-AXIS 

IN FRAUNHOFER 

SPACE 



TOP-HAT fUNCTION 



OF WIDTH a 




NARROW SLIT I 



x-AXIS 

IN OBJECT SPACE 



(b) 



Fig 6.31. The 'top-hat' distribution corresponding to a plane wave incident upon a 
single narrow slit in a screen 

magnitude irrespective of the direction of s. The aperture function 
A is then given by 

A = constant Xf(r p , 0, <j>) 

and Equation 6.50 becomes 

U= J Aexp(jkr.s)dV 

which is evaluated from the equivalent result 

U= j A (Fourier transform of A)dV (6.51) 



TREATMENT OF DIFFRACTION AND INTERFERENCE 293 

Consider a plane wave of monochromatic radiation of wave- 
length A incident upon a narrow slit in a screen [Fig. 6.31(a)]. 
Within the slit itself, the spatial distribution of amplitude is a con- 
stant irrespective of position (presuming that the slit is uniformly 
illuminated). Hence/(r p , 6, </>) is a constant and, presuming that the 
Huygens' secondary wavelets are spherical, the aperture function 
is a constant of value A , say. Outside the slit in the plane of the 
screen, the aperture function will be zero. The distribution function 
corresponding to the constant-aperture function A is called the 
'top-hat' function [Fig. 6.31(b)]. 

In the analysis, it is convenient to use Cartesian coordinates: 
one set x, y and z in the object space, and a parallel set x', y' and z' 
in the Fraunhofer space. For the narrow slit in the circumstances 
specified, the distribution of amplitude as a function of x only is 
required. The origin O is taken to be at the centre of the slit, so x is 
zero at O. In the Fraunhofer space, x' is taken to be zero at the 
point 0\ where the direct light perpendicular to the plane of the slit 
intersects a plane in the Fraunhofer space parallel to the screen 
containing the slit [Fig. 6.31(a)]. If the slit width is a, the 'top-hat' 
function will have the value A over the interval from — a/2 to + a/2 
and will be zero elsewhere. The amplitude distribution in the object 
space is therefore given by 

f(x) = A 

for — a/2 -= x < + a/2 and 

Rx) = 

for x > +a/2 and x < —a/2. 

The Fourier transform of /(x) (see Appendix, page 362) is given 
by 

f(x') = j" f(x) exp (— 2njxx') dx 
which, with/(x) = A over the limits specified, becomes 

a/2 

f(x') = J ^4oexp ( — 2tt jxx:') dx 

-o/2 

Therefore 

An 
fix') — — ^ — :— , [exp (— 2jiix'a/2) — exp (2jrjx'a/2] 
— 2ti )x 

Putting a = nax', then from Equation 6.51 the expression 

tt> n >\ A<& . . . Affi. sin a 
0x=/(x')=-=j-(-2jsina)= 

gives the amplitude U' x at a point x' in the Fraunhofer space. 



294 



UNIVERSITY OPTICS 




Fig. 6.32. Amplitude pattern in the Fraunhofer space when a plane wave is incident 
upon a narrow slit 



At x' = in the Fraunhofer space, 



because 



Hence 



U' x = U =/(0) = Aoa^ = Aoa 



,. sin a , 

hm = 1 

a-»o a 

U' x _ sin a 



(6.52) 



and 



/o 



sin 2 « 



»a 



where I x and /o are the intensities corresponding to the amplitudes 
V' x and Uo respectively. 

This result is the same as that obtained by a different analytical 
method in Section 6.11, leading to Equation 6.24. Note that 
a = nax' in Equation 6.52 is the same as the a used in Section 6.1 1 
because x' = (sin 0)/A. 

The distribution of amplitude in the Fraunhofer space due to a 
single slit as given by Equation 6.52 is shown in Fig. 6.32. It is 
interesting to note that conversely the Fraunhofer pattern of such 
a distribution will be its Fourier transform, which will be the 'top- 
hat' function of Fig. 6.31. 



TREATMENT OF DIFFRACTION AND INTERFERENCE 295 

The Fourier approach to Fraunhofer diffraction described here 
in the case of a single narrow slit is only indicative of a powerful 
analytical method of particular value in problems involving dif- 
fraction and interference effects of a complexity beyond the stan- 
dard of this text. 



Exercise 6 

1. Distinguish between Fraunhofer and Fresnel diffraction. Explain the action 
of a zone plate and show that the object and image distances obey the ordi- 
nary thin lens formula. 

A plane parallel beam of microwave radiation of wavelength 3 cm falls 
normally on a circular aperture in a conducting screen. What radii can the 
aperture have in order to produce a focus at an axial distance of 10 metres? 

(L.P.) 

2. Explain what is meant by (a) Fresnel half-period zones, (b) a zone plate. 
Describe one simple experiment that can be explained in terms of half-period 
zones. 

A small source emitting light of wavelength 5000 A is placed on the axis in 
front of a zone plate at a distance of three metres from the plate. If the 
diameter of the first zone is 2 mm describe with the necessary calculations 
what may be observed at points on the axis behind the plate. (L.G.) 

3. Give a qualitative description of the application of the Cornu spiral to the 
diffraction of a monochromatic cylindrical wave front by a screen having 
a straight edge parallel to its axis. Discuss the variation in intensity across 
the line of the geometrical shadow from uniformly dark to uniformly bright 
regions. (L.P.) 

4. Explain what is meant by a Fresnel half-period zone and use the concept of 
half-period zones to explain the variations in intensity occurring along the 
axis of a circular aperture placed tangentially to the wavefront emitted from 
a point source of monochromatic radiation. 

A point source of monochromatic radiation of wavelength 5460 A is situated 
50 cm from a circular aperture 1 mm in diameter, the aperture being tangen- 
tial to the wavefront. Considering points on the axis calculate the greatest 
distance from the aperture at which a maximum of intensity occurs. Deter- 
mine also the positions of the next two maxima. (City) 

5. Show that, if a cylindrical wavefront is considered as consisting of a number 
of narrow rectangular zones (considerably smaller than half-period zones) 
with uniform phase difference from one to the next, a phase diagram may be 
constructed from which the amplitude at a point due to any part of the 
wavefront may be determined. 

A source of wavelength 5460 A producing a cylindrical wavefront is set up 
50 cm from an adjustable slit whose length is parallel to the axis of the wave- 
front and whose plane is tangential to it. A screen is placed 100 cm from the 
slit, the planes of the screen and slit being parallel. Use the Cornu spiral 
supplied to predict the general shape of the intensity diffraction pattern on 
the screen when the slit is (a) 6x 10 -3 cm, (b) 7-8X 10 -2 cm, (c) 0-115 cm 
wide. (Detailed drawings and calculations not required.) (City) 



296 UNIVERSITY OPTICS 

6. A reflecting astronomical telescope focusses light from a star on to a photo- 
graphic plate using a concave mirror of diameter 120 cm and focal length 
700 cm. Describe and explain the photographs which may be obtained when 
the front of the tube is covered with (a) no screen, (b) an opaque cover with 
two circular apertures 12 cm in diameter and 100 cm apart, (c) a transmission 
grating with a grating space of 1 mm, (d) a square-mesh wire net with wires 
1 mm thick spaced apart by 2 cm and (e) a mosaic screen complementary to 
the wire mesh in (d). (L.P.) 

7. Show that the Fraunhofer diffraction pattern of a simple one-dimensional 
aperture is the Fourier transform of the aperture function. 

Describe what happens to a Fraunhofer single-slit diffraction pattern of 
monochromatic light if the whole apparatus is immersed in water. 
A circular piston 60 cm in diameter oscillates at a frequency of 25,000 c/s 
as an underwater source of sound for submarine detection. Far from the 
source the sound intensity is distributed as a Fraunhofer diffraction pattern 
for a circular hole whose diameter equals that of the piston. Take the speed 
of the sound in water to be 1,450 m/s and find the angle between the normal 
to the piston and the direction of the first minimum. (M.P.) 



CHAPTER 7 



Colour and Colour 
Measurement 



7.1 Introductory Concepts 

The reproduction of colour by physical and chemical processes in 
such technologies as those of colour photography, colour tele- 
vision, printing, fabrics, paper, plastics and ceramics has led to a 
twentieth-century world in which colour science has to a large 
extent taken over in a field in which formerly the artist was supreme. 
Apart from the fact that this change has probably been a main 
reason for the peculiarities of much of modern art, it has led to a 
more colourful environment and the necessity for measuring colour 
in a scientific manner which is, as far as possible, independent of 
the observer. 

Independence of the observer is, however, not strictly possible, 
because the treatment of colour is essentially psychological or, 
better, psychophysical. In the first place, colour is associated with 
visible radiation having a given spectral distribution. But the human 
eye does not operate like the spectrometer: it is incapable of anal- 
ysis of the component wavelengths in the spectrum; its response 
varies through the spectrum. The human eye is only capable of 
discerning three types of sensation : hue, brightness and saturation. 
Moreover, whereas radiation with a given spectral distribution 
produces a certain sensation in a normal observer, this same 
sensation — meaning that the observer sees the same colour — can 
be produced by a variety of different spectral distributions. 

The term hue is, scientifically speaking, regarded as having a 
different meaning from the term colour in that it represents that 
aspect of colour which changes depending on the part of the 

297 



298 UNIVERSITY OPTICS 

spectrum involved; hue is therefore concerned with the dominant 
wavelength present in the radiation. 

In relation to sources of light, a main characteristic is luminous 
flux which evokes the sensation of brightness. Brightness is a 
measure of the total sensation ; for a luminous source the more 
specific term is luminous emittance; for a body which gives out 
light by reflection and scattering when illuminated by a luminous 
source, the specific term is luminance (Section 6.4 of Volume 1). 

Saturation is a measure of the purity of the colour. A saturated 
colour is pure: the monochromatic spectral radiation over a small 
wavelength interval in the visible spectrum is pure. Saturation 
decreases as the colour is mixed with white — the colour becomes 
desaturated. 

The term chromaticity is used to denote both dominant wave- 
length (hue) and purity. 



7.2 Additive Colour Mixing 

One of the most fascinating results of scientific experiment is that, 
if three light sources of different hues are available, a whole range 
of colours can be produced by mixing these three different hues in 
various amounts. Depending on the three different hues chosen, 
so the ability to match a wide range of chromaticities (hue and 
saturation) varies. By a judicious choice of the three light sources, 
almost all chromaticities can be simulated. The usual choice for 
three primaries in additive colour mixing are red, green and blue. 

This ability to match almost all colours by an appropriate addi- 
tive mixture of three primaries is an established fact based on 
experiment. The explanation depends on the way in which the 
human eye works in discerning colour; this demands a theory of 
colour vision. 

To explain three-colour theory, Young and also Helmholtz 
supposed that there are three sets of nerves in the retina of the eye, 
one for each primary colour. White light stimulates all three sets, 
yellow light stimulates the green and red nerves together, blue- 
green light stimulates the blue and green receptors, and similarly 
for other colours. These colour receptors must be associated with 
the cones in the retina and not the rods, which lack colour sensitiv- 
ity (Section 3.2 of Volume 1). 

There are objections to this theory. Anatomical examination of 
the eye does not reveal three sets of nerves to each sensory part of 
the retina. Again, this theory implies that red light will excite only 
the 'red' nerves, yellow will excite the blue and green nerves, and 
blue light will excite all three sensations. This was shown to be 
incorrect by Koenig, who experimented on the observation of 



COLOUR AND COLOUR MEASUREMENT 



299 




Fig. 7.1. Additive colour mixing 



monochromatic radiations through the spectrum. Another difficulty 
is that the sensation of white light can be produced by two comple- 
mentary colours. Thus if the red portion is removed from the 
continuous white light spectrum, the remaining colour is blue- 
green, which is complementary to red. A colour-blind observer 
incapable of seeing red may be considered simply to have the 'red' 
nerve sensation absent in his retina. To him, the combination of 
red and blue-green as complementary colours should not produce 
the sensation of white. But this is not correct. 

Subsequent theories of colour vision have been put forward 
by Hering and, in recent years, much work has been done by 
W. D. Wright and others. The theories are useful, but not yet 
believed implicitly. A satisfactory explanation has yet to be evolved. 

Despite the lack of a full explanation of three-colour additive 
mixing, the fact that it is possible remains, and it is widely used in 
the specification and measurement of colour. Additive colour 
mixing may be demonstrated simply by the use of three projection 
lanterns, one with a red filter over its projection lens, the second 
with a green filter and the third with a blue. Other colours (provided 
they are different from one another) can be used, but red, green 
and blue are a good choice. 

Suppose these lanterns each projects separate circular patches of 
light on to a white screen. One patch R will be red, the second G 
will be green and the third B blue. Let each projector lamp be 
provided with a variac transformer control of the voltage, so that 
the amounts of red, green and blue light can be separately varied. 
The projectors are then set so that the circular patches of light 
overlap completely. By varying the amounts of the three colours, 
a very wide range of different colours can be produced, and an 
accurate match can be made to a separate patch of light X from 



300 UNIVERSITY OPTICS 

a fourth projector furnished with a filter different from those con- 
cerned with patches R, G and B (Fig. 7.1). 

This demonstration forms a crude additive colorimeter. The 
mixture is additive, because the amounts of reflected (or scattered) 
light in the three components R, G and B are superimposed upon 
one another in the total light reaching the eye. The screen has to 
be white simply because white material is able to scatter light of 
any colour. 

The eye is not able to detect the primary components in a 
mixture. Two or more light vibrations of different frequencies 
when mixed give a sensation at the eye of a single frequency. 
This contrasts with the human sense of hearing, in which two sounds 
of different frequencies can be distinguished individually at the 
same time. 



7.3 Tristimulus Additive Colour Matching 

To render the simple experiment described in Section 7.2 more 
precise and so form a basis of quantitative colorimetry, the gen- 
eration of pure spectral colours is preferable to the use of filters. 
Three primaries in the form of monochromatic radiations at 
wavelengths 6,500 A (red), 5,300 A (green) and 4,250 A (violet) 
produced by a prism or grating monochromator may be used. 
The amount of each primary hue required to generate a particular 
colour is known as the tristimulus value. 

In practice it is found that, however judicious the choice of the 
three primaries, it is not possible to generate all possible chromat- 
icities, though a very wide range can be produced by the three 
primary monochromatic radiations specified. For example, it is 
not possible to obtain a match by such three-colour addition of 
saturated blue-green. . 

There is, however, a solution to this problem. If the light X to be 
matched by the addition of the given primaries R, G and B (or V, 
for violet) is outside the range producible by these primaries, a 
match can be obtained if the correct amount of light from one of 
the primaries is added to X, thus R, G or B is added to X. This 
corresponds to subtracting light from the match: the tristimulus 
value of the subtracted primary is then negative. 

The selection of the particular red, green and blue primaries in 
additive mixing is not the only choice but the one which allows a 
wide range of colours to be matched with a minimal use of negative 
primaries. 

An interesting illustration of colorimetry using the primary 
monochromatic radiations at 6,500 A, 5,300 A and 4,250 A is the 
determination of the tristimulus values needed to produce pure 



COLOUR AND COLOUR MEASUREMENT 



301 



spectral hues in various parts of the visible spectrum. In the. three 
graphs of Fig. 7.2 are shown the number of lumens in the luminous 
flux at the wavelengths 6,500 A, 5,300 A and 4,250 A required to 
match 1 W of radiant flux at the various wavelengths between 




4,000 5.000 6,000 7,000 
WAVELENGTH, \{i) 



(a) 



1,000 5,000 6,000 7,000 

wavelength,* (a) 



(b) 



5 40 
S 30 



A 






A 






























1 












/ 


^ 






V_«i 





(c) 



4,000 5,000 6,000 7,000 
WAVELENGTH, X (a) 

Fig. 7.2. Matching pure spectral lines by the use of primaries: (a) at 6,500 A; 

(b) at 5,300 A; (c) at 4,250 A. Tristimulus values in lumens to match 1 W 

of monochromatic radiation of wavelength A 



4,000 A and 7,000 A marked out on the abscissae. For example, 
the tristimulus values to obtain a match to 1 W of monochromatic 
radiation at a wavelength of 5,000 A are, approximately, —20 lm 
at primary 6,500 A [Fig. 7.2(a)], 264 lm at primary 5,300 A [Fig. 
7.2(b)] and 12 lm at 4,250 A [Fig. 7.2(c)]. 



302 UNIVERSITY OPTICS 

7.4 The CLE. System in Colour Measurement 

The use of actual primary colours necessitates the use of negative 
tristimulus values in obtaining matches to some of the chromatic- 
ities encountered in the very wide range possible. This is indicated 
by the negative values in the graphs of Fig. 7.2, which represent 
an excellent choice of primaries. 

This use of negative values is undesirable in any mathematical 
formulation of colour matches. It can be avoided if three specified 
component primary lights are chosen having certain tristimulus 
values at various wavelengths. This can be done theoretically, 
though not practically. Thus imaginary primaries may be chosen 
for convenience, even though no conceivable sources of light are 
available for producing them. 

The International Commission of Illumination (CLE., from the 
French 'Commission Internationale de l'Eclairage') agreed in 1931 
to express all colour mixture data in terms of three specified compo- 
nent lights. 

In making their recommendation, the following principles were 
observed: 

1. To eliminate the effect of colour vision variation amongst 
human beings (which is significant even for the majority of 
people with 'normal' three-colour vision) and to avoid the 
inevitable subjective nature of colour matching, a standard 
observer was arrived at by averaging the responses of a large 
number of observers. 

2. The three standard primary components chosen were such 
that the tristimulus values of all real colours are positive. In 
avoiding negative values it was necessary to choose reference 
stimuli, which are not real but 'imaginary' colours. The plots 
of these tristimulus values against wavelength curves for these 
reference stimuli (Fig. 7.3) are curves x, y and z, nominally 
red, green and blue. 

3. The curve y (green stimulus curve) was plotted so as to be 
similar to the relative luminous efficiency against wavelength 
curve for the human eye (Fig. 6.5 of Volume 1). The maximum 
tristimulus value at wavelength 5,550 A was therefore chosen 
to be unity. This is convenient because it ensures that the 
luminance of the green (curve y) primary is the same as that 
of the mixture, provided that it is also arranged for the lumi- 
nance of the red (curve x) and blue (curve z) primaries to be 
zero. 

4. The areas under each of the curves x, y and z are made equal. 
This necessitates a choice of the tristimulus values for curves 
x and z to agree with those selected in accordance with the 



COLOUR AND COLOUR MEASUREMENT 



303 



1-5 



0-5 



-- 


— c 


.CURVE 

Vblue) 




















CURVE y 
(GREEN) 




CURVE a 
\fRED) 




























j cukv 


Ex \ 















4,000 



5,000 6,000 

WAVELENGTH, X (A) 



7,000 



8,000 



Fig. 7.3. The CLE. standards for colour mixing 



criteria given for curve y in the paragraph above. As the 
tristimulus values depend on the response of the eye at a given 
wavelength times the energy radiated per second (radiant 
energy emitted by the source per unit time, specified in watts), 
the curves x, v and z need to be equal-energy curves; this 
means that the radiant flux is the same in each and every 
wavelength interval. 

The symbols x, y and z are used for the ordinates of each curve, 
x, y and z respectively. The values of x, y and z at any particular 
wavelength are thus the tristimulus values of a spectral colour of 
that wavelength. For example, the tristimulus values of light of 
wavelength 5,000 A are x = 0-0049, y = 0-3320 and z = 0-2720 
(these values can be seen approximately in Fig. 7.3). 

Consider the CLE. specification of a source of light which, in 
general, will not be monochromatic. This source will have a certain 
spectral distribution : a radiant flux in a small wavelength interval 
A to X+dl of f(X)dX, where there is a spectrophotometrically 
determined graph of /(A) against A. Let the tristimulus values 
required for the standard CLE. primaries x, y, z to match the 
spectral distribution of this source be X, Y and Z respectively. 

As the ordinates of the curves on the tristimulus graph represent 
the amount of the standard CLE. components required to match 
unit radiant flux at each wavelength: 



304 UNIVERSITY OPTICS 

The total amount of red (x) standard light flux = X = 

Jjc/(A)dA; 
o 

The total amount of green (y) standard light flux = Y = 

jyf{X)M; 
o 

The total amount of blue (z) standard light flux = Z = 

Jz/(A)<R 

o 

The integration needed is undertaken usually for about 30 values 
o(f(X). Graphical, numerical or mechanical methods are necessary, 
as f(X) against A and the curves of Fig. 7.3 cannot be expressed as 
mathematical functions. 

Having found the X, Y and Z tristimulus values for the light 
source in question, the chromaticity values x, y and z of the same 
source may be calculated. These values are known as the trichro- 
matic coefficients. They provide a method of specifying colour 
independently of intensity because they are defined as : 



x = 



y = 



X+Y + Z 

Y 

X+Y + Z 

Z 

2 ~ X+Y+Z 

These values could be represented on a three-dimensional diagram. 
However,x+y + z = (X+Y+Z)/(X+Y+Z) = 1, so two quantities 
are sufficient to define a colour. Hence, any colour may be repre- 
sented on a two-dimensional coordinate system. If the same com- 
putation is undertaken for monochromatic sources, a curve called 
a spectrum locus on the chromaticity diagram is obtained (Fig. 7.4). 
Reference is often made to 'white' light. This needs precise 
definition, as a wide range of sources from a tungsten filament 
lamp to the Sun are roughly said to produce white light. White is 
conveniently defined as the colour of a source emitting a continuous 
equal spectrum throughout the visible region : i.e. throughout the 
visible spectrum, the radiant flux (radiant energy emitted by the 
source in unit time, specified in watts) is the same in each and 
every wavelength interval. The radiation from the Sun forms a 
reasonable approximation to an equal-energy spectrum. 



COLOUR AND COLOUR MEASUREMENT 



305 



i 0-9 



08 



0-7 



0-4 



0-3 



01 



5,200 


















^5,300 
















s^,400 












f 5,100 




jK 


5.500 










**> 


> 


IT 

I 


%5j 


>00 












\ 


""•6 


>5,70 


) 






5,000 




1 


















1 




\5,800 












\ 
\ 




\5 


,900 










\ C 


D 




\^,000 
















6,200 


Woo 




/ \ 

F 






"** **• „^ 


^S, H 






/ 








■^7,000 






/ 
















/ 












•4,800* 




/ 












4,700\ 


t ., 












4,60o\ y 

4.000\*. 















01 G 02 0-3 0-4 0-5 0-6 07 0-8 



Fig. 7.4. The spectrum locus on a chromaticity diagram 



If equal radiant fluxes of the primary red, green and blue lights 
are additively mixed, white light is generated. In the chromaticity 
diagram (Fig. 7.4), the values of x, y and z must therefore all be 
equal to 0-33, so white is represented on the diagram by point E, 
where x — y — 0-33. 

The trichromatic coefficients for any colour other than mono- 
chromatic light lie within the spectrum locus on the chromaticity 
diagram. Thus, whereas monochromatic light of wavelength 
5,000 A has trichromatic coefficients of x = 0-0082, y = 0-5384 
and z = 1 — x— y = 0-4534 and is on the spectrum locus, point E 
for white light (almost the same as sunlight) and point C for 
average daylight are inside the locus. 

Let any two colours be represented by points A and B on the 
chromaticity diagram. Suppose these colours are mixed additively. 
The point representing the chromaticity of the mixture will then 
lie at Q on the straight line connecting A and B. The position of 
Q on this line will depend on the sum of the tristimulus values (not 
the coefficients) for each of the components A and B, which is 
decided by their colours and their relative amounts. If this sum is 
I a for A and l B for B, the point Q will be at a distance p from A and 
q from B, where p l A = q l B . 



306 UNIVERSITY OPTICS 

The diagram can be used to specify colour in a manner more 
easily visualised than from the three trichromatic coefficients. All 
colours obtained by a mixture of white light E and the monochro- 
matic light P are on the line PE. The colour D, for example, can 
be expressed in terms of purity and dominant wavelength. The 
dominant wavelength is decided by such a point as P on the spec- 
trum locus; the degree of saturation with white light by the ratio 
of the distances PDjDE. 

This method of specification cannot be applied however to any 
point such as F, which is within the triangle EGH on the chromat- 
icity diagram (Fig. 7.4). This is because a straight line from E 
through F does not intersect the spectrum locus, but the line GH. 
The interpretation of this is that all colours represented by points 
within the triangle EGH are not desaturated spectral colours — they 
are non-spectral colours, the purples or magentas. To find the 
dominant wavelength in a purple such as that represented by point 
F, FE is produced to cut the spectrum locus at J, near 5,400 A. 
Purples are mixtures of red and blue or violet, and may be described 
as 'minus-greens'. For example, the point F corresponds to white 
light (represented by E) minus light represented by point /, in the 
green at 5,400 A. 



7.5 Reflection and Scatter of Light 

So far the discussion of colour has been concerned with sources 
of light. Greater interest in practice is attached to the colour of 
surfaces illuminated by a source of light. This colour will depend 
on the nature of the illuminating source, as everyone will know 
who has seen a friend by tungsten lamp and then by sodium street 
lamp lighting. 

It is clearly essential to standardise the luminous source by which 
the colour of a surface is perceived. The CLE. recommend three 
standards. These are provided by incandescent tungsten filament 
lamps furnished with a suitable colour filter, preferably in the form 
of a specified absorbing solution. The standards give a close ap- 
proximation to the distribution of energy in the spectrum from a 
black-body. Referring to Fig. 7.4, they are A at 2,848°K (the 
incandescent tungsten lamp used for general illumination purposes 
is thereby simulated), B at 4,800°K and C at 6,500°K (an approxi- 
mate simulation of average daylight). For B and C, the reddish 
component of tungsten lamp radiation has to be absorbed by the 
use of blue filters, which absorb the complementary red. 

Measurement of surface colour with a standard source of illu- 
mination is then undertaken and specified in the same way as for 
actual luminous sources. 



COLOUR AND COLOUR MEASUREMENT 

7.6 Subtractive Colour Mixing 



307 



In additive colour mixing, there is no light initially, and then 
selected amounts of the primary colours red, green and blue are 
added; in the subtractive colour mixing, light of all wavelengths 
is usual at the beginning (the frequent case being white light), 
and chosen amounts of red, green and blue are subtracted from the 
white. 

If a red filter is placed over a sheet of illuminated white paper, 
the red light enters the eye because the filter has subtracted from 



100 



ao 



60 



40 



20 











CURVE 2 
(YELLOW) 






CURVE 1 
(BLUE) 
























/ / 
/ / 

r / 


r CURVE J 

(BUIE AND 

YELLOW) 


\ \ 
\ \ 
\ \ 








/ 
/ 

r 




> 







4,000 



SfiOO 6,000 

WAVELENGTH, X (A) 



Fig. 7.5. Spectral transmittance curves: curve 1 is for blue filter, curve 2 is for 
yellow filter, and curve 3 is for blue plus yellow filters 

the white the complementary colour to red — thus it has absorbed 
the blue. 

Corresponding to the additive primaries red, green and blue, 
the subtractive primary colours will be the complementary ones to 
these three: minus-red, minus-green and minus-blue, which are blue- 
green (cyan), magenta and yellow respectively. 

Additive mixing is concerned essentially with the addition of 
colour lights; subtractive mixing involves the use of absorbing 
filters, dye-stuffs and pigments. 

An experiment in subtractive colour mixing comparable with 
that described on additive mixing in Section 7.2 utilises only one 
projection lantern instead of three. Suppose a blue filter B is placed 



308 UNIVERSITY OPTICS 

over the projector lens. The colour of the light transmitted will 
be blue and will have a spectral distribution decided by transmit- 
tance against wavelength (curve 1 of Fig. 7.5). This colour is a 
consequence of the components of the 'white' light from the pro- 
jector which have been absorbed by the filter : the dye-stuff in the 
gelatine filter subtracts the complementary colour minus-blue 
(or yellow). If a second filter Y which transmits yellow light with a 
distribution represented by curve 2 of Fig. 7.5 is placed over the 
first, so the transmitted light now passes through B and Y, the 
spectral distribution in this transmitted light will be decided by 
multiplying together the ordinate in the two transmittance curves. 
The result is a spectral transmittance shown by curve 3 of Fig. 7.5 ; 
the transmitted light will now be green. 

If the yellow filter Y transmitted light truly complementary to 
that transmitted by the blue filter B, the result of the two together 
would be no passage of light at all. It is, however, not possible to 
obtain dye-stuffs which are truly complementary in this way. 

Colour reproduction and colour photographic processes are all 
subtractive, although additive colour photographic processes (for 
example, Dufaycolor) have been used but are now obsolete. Colour 
printing processes utilise three coloured inks in which the colours 
are printed on top of one another in succession. Colour transpar- 
encies familiar in photography are three-colour with colour- 
sensitised emulsions placed on top of one another. 

In a colour printing process on white paper, the clean paper 
initially reflects white light back to the eye. As each of the primary 
colours is printed in turn, each subtracts from the white light 
depending on the absorption spectrum of the particular ink used 
and to an extent depending on the concentration of colouring 
matter in the ink. The transmitted colour to the eye is the comple- 
mentary one for the first printed colour, and is then decided by the 
products of the transmittances as the second and third colours 
are printed on top. 

In practice, colouring materials and dyes used in colour printing 
and in photography cannot be produced with absorptions of satis- 
factory distribution in the spectrum. Ideally, sharp cut-off curves, 
such as that shown in Fig. 7.6(a) for a filter which transmits red, 
are needed; however, in practice, a curve such as Fig. 7.6(b) is 
obtainable. Consequently, colour printing methods are unable to 
reproduce good 'blacks', so four-colour printing methods are 
often used, the fourth ink being black. Likewise, three-colour 
photographic processes, though they give aesthetically pleasing 
results, do not reproduce accurately the colours in nature. Indeed, 
though much more versatile and convenient, the present-day 
subtractive methods are not so accurate in this respect as the older 
additive procedures at their best; the subtractive process in general 



COLOUR AND COLOUR MEASUREMENT 



309 





5 

3 



4,000 «, 7,000 £ 4,000 „ 7,000 

W«ELEHCTH,X(») WAVELENGTH, X (») 

(a) (b) 

Fig. 7.6. Spectral transmittance curves for a red dye with minus-red absorption: 
(a) ideal curve; (b) actual curve 

is not capable of as good colour mixing as the additive ones, be- 
cause the absorption bands of various pigments overlap to some 
extent. 

Colour reproductions of paintings vary greatly in quality. The 
cheaper ones utilise four-colour printing: the subtractive primaries 
plus black. More expensive methods utilise a greater number of 
colours in the attempt to achieve verisimilitude. 



7.7 The Measurement of Colour: Colorimeters 

A number of instruments termed additive trichromatic colorimeters 
function, as their name implies, by varying the amounts of three 
component colours necessary to produce a colour match. The Ives 
instrument uses filters to obtain the three components, and the 
amounts of each are varied by the use of diaphragms of variable 
width. 

Wright described, in 1929, a colorimeter for matching any 
spectral colour against three primary hues at wavelengths of 6,500 
A (red), 5,300 A (green) and 4,600 A (blue). The required three 
beams of light of the primary hues were produced at selected 
positions in a continuous spectrum. This spectrum was obtained 
by dispersion of white light incident normally upon a pair of 60° 
prisms in series. Within the resulting spectrum were placed, at 
judicious positions, three small right-angled prisms; these returned 
the three light beams of the primary wavelengths required back 
to two further 60° prisms, which neutralised the dispersion in the 
beams to give a single emergent beam brought by further prisms 
to one-half of a field of view in an eyepiece. To enable the amount 
of light in each of these three primary beams (before recombina- 
tion) to be varied, a neutral wedge was placed in each of them. 
The optical density of each wedge was varied depending on its 
position (regulated by a drum-drive) in the beam. 



310 



UNIVERSITY OPTICS 



The separate beam of light of a spectral colour to be matched 
against the mixture of primary constituents was arranged to illu- 
minate the neighbouring other half of the eyepiece field of view. 
To this separate beam could be added, as required, an adjustable 
amount of a desaturating primary colour — thus providing the 
negative stimulus values needed to enable any spectral colour to 
be matched. 

The Donaldson colorimeter (described in 1935) depends on a 
colour-mixing operation utilising an integrating sphere (Fig. 7.7). 




TELESCOPE f RED — -f Y////& 

FILTERS \ GREEN -\~VAUA )c 



■ MOVABLE 
SHUTTERS 



V 



Fig. 7.7. The Donaldson colorimeter 



Light from the incandescent filament lamp S passes through three 
primary colour filters C (red, green and blue), then traverses the 
condensing lens L to form an image of the light source in the 
aperture A of the hollow integrating sphere B, which is internally 
coated with white magnesium oxide. The three primary beams 
are mixed by diffuse reflection at this inner surface of the sphere. 
Consequently, light of uniform colour (a mixture of the three 
primaries) emerges from a separate window W in the sphere. By 
means of the right-angled prism R and lens L\, this light illuminates 
one-half of a Lummer-Brodhun photometer cube P (Section 6.10 
of Volume 1). 

The other half of the photometer cube is filled with light from 
the specimen X, which is illuminated by a standard CLE. lamp 
(Section 7.5). The lens L 2 and the objective lens O of the viewing 
telescope form an image of the specimen at the eyepiece E so that 
the structure of the specimen is not observed when a colour match 
is being made. To enable this match to be made, the amounts 



COLOUR AND COLOUR MEASUREMENT 311 

of the three primary coloured beams reaching the integrating 
sphere are varied by means of movable shutters over the colour 
filters C or by the use of adjustable neutral wedges. 

As always, for matching highly saturated colours, desaturation 
of the specimen colour (i.e. provision of a negative primary) is 
needed. This desaturation component light is provided by light 
from the source S reflected by the mirror M and the transparent 
plate N, where any one of the three primary colour filters (identical 
with those used at C) is inserted. These filters are mounted on a 
rotatable wheel T, and intensity variation is brought about by a 
circular neutral wedge Q, with a diffusing glass K within the beam. 

In the Eastman subtractive colorimeter, a sample colour is 
matched by light passing through three optical wedges in succes- 
sion ; the thickness of the wedges in the light beam depends on their 
adjustable position. The wedges are minus-red, minus-green and 
minus-blue, forming selectively absorbing material through which 
light is transmitted to match the sample colour. 



Exercise 7 

1. In relation to terminology in the science of colour, give definitions of hue, 
saturation, luminance and chromaticity. 

Explain with the aid of simple diagrams what is meant by: (a) additive 
colour mixing, and (b) subtractive colour mixing. 

2. Write an account of the tristimulus values used in additive colour match- 
ing and explain how negative tristimulus values may arise. 

3. Explain the physical reasons for the reddish colour appearance of the fol- 
lowing: (a) a piece of iron at a temperature of about 900°C, (b) a red dye- 
stuff; (c) the red region of a spectrum produced from white light by a prism ; 
(d) the Sun when near the horizon on a clear day ; (e) a discharge in neon gas ; 
(f ) the markings of the wings of certain butterflies. 

4. Write an essay on the CLE. system in colour measurement. The X, Y 
and Z tristimulus values for a source of light are 20 lm, 30 lm and 50 lm 
respectively. What are the trichromatic coefficients of this source? 

5. Describe in detail one form of trichromatic colorimeter and explain how 
it is used to determine a colour match for a coloured object which is illu- 
minated by a standard CLE. lamp. 



CHAPTER 8 



The Microscope 



8.1 Introduction 

The principles of the compound microscope have been described 
in Volume 1. The limit of resolution (Equation 12.7 of Volume 1) 
is given by 

h = ™L (8.1) 

n sin U 

where h is the least separation between object points which can 
give rise to images distinguishable as separate in the field of 
view, n is the refractive index of the medium in which the object 
is immersed, U is the semi-angle at the vertex of the cone of rays 
diverging from the object to reach the objective lens, and A is the 
wavelength of the radiation. 

In arriving at Equation 8.1, it is assumed that the object is 
self-luminous and that points on it to be distinguished emit inco- 
herent radiation. Though this is true for a telescope being used to 
view stars, for example, it is usually untrue for a microscope 
because the object is frequently in the form of a specimen sand- 
wiched between a microscope slide and a cover glass. The illumi- 
nation is from a lamp with a sub-stage condenser, which focuses 
light from the lamp at the plane of the object. In such circumstances, 
the limit of resolution is not usually decided by Equation 8.1 but 
by a modified equation 

* = -°^7 (8-2) 

n sin U 

which arises from the Abbe theory of the microscope. Before 
this theory is dealt with, it is necessary to describe the sub-stage 
condenser systems employed. 

312 



THE MICROSCOPE 

8.2 Sub-stage Condenser Lenses 



313 



The purpose of a sub-stage condenser is to provide brilliant illu- 
mination at a selected small area of the microscope slide in order 
that the field of view in the microscope eyepiece is sufficiently 
bright when high magnification is used. 

Two methods are used commonly: critical illumination and 
Kohler illumination. In the former, the source of light S [Fig. 8.1(a)] 
is an opal lamp or an illuminated diffusing screen. An image of S 



/OBJECTIVE LENS 
„/ OF MICROSCOPE 




00 



OBJECT 
(MICROSCOPE slide) 



OBLIQUE RAYS DUE TO 
OBLIQUE INCIDENCE 

_,^- OBJECTIVE LENS 
<£. OF MICROSCOPE 



TO EYEPIECE 




(IN FIRST FOCAL 
PLANE OF L,) 



OBJECT 
(MICROSCOPE slide) 



00 



Fig. 8.1. Sub-stage microscope condenser arrangements : (a) critical illumination; 
(b) Kohler illumination 



is formed by the sub-stage condenser lens L at the object (i.e. a 
small brilliant patch of light is formed in the plane of the micro- 
scope slide). Two iris diaphragms are usually incorporated: £>i to 
vary the aperture of the sub-stage condenser lens; and D 2 to 
control the area of the object which is illuminated so that details 
of interest in the object can be 'pin-pointed' and light from other 



314 



UNIVERSITY OPTICS 



parts of the object can be excluded from the microscope objective 
lens. 

In Kohler illumination [Fig. 8.1(b)], two lenses are used to form 
the sub-stage condenser. The field lens L x forms an image of the 
source S (often a lamp filament) in the first focal plane of the 
condenser lens L 2 . The light emerging from L% is consequently 
collimated. These emergent collimated beams will be at angles to 
the axis dependent upon the angles of the incident light from L\. 
The object is placed at the intersection of these emergent collimated 
beams, of which the extreme obliquity can be altered by an iris 
diaphragm £»i placed to define the cone of light incident upon L 2 
from Li. A second iris diaphragm D 2 is placed so that its image 
is formed in the plane of the object and it thus enables the area of 
the object illuminated to be controlled. Kohler illumination gives a 
greater brightness of light spot at the object than does critical illu- 
mination, and is preferred, especially for high magnification. 

8.3 The Abbe Theory of the Microscope and Its Limit 
of Resolution 

Objects viewed in microscopy are not usually self-luminous but are 
illuminated by light from a source focused by a sub-stage condenser. 
The light scattered by the object into the objective is therefore 
partly coherent, particularly if Kohler illumination is practised. 
In the fuller theory of the microscope, it must consequently be 
taken into account that the incident light is diffracted both by the 
object and the objective, and not by the latter alone. 

To deal with an object having detail of any kind of geometry is 
theoretically very difficult. Abbe simplified the consideration by 
discussing an object in the form of a regular grating and illuminated 
by a coherent plane wave from the condenser; this gives a useful 



RAY OF LI6HT FROM 
SUB-STAGE CONDENSER 




GRATING 
OBJECT 



OBJECTIVE 
LENS 



Fig. 8.2. Abbe theory of resolution of a microscope objective for an object in the 

form of a grating 



THE MICROSCOPE 3 ] 5 

approximation to several object specimens which have a fairly 
regular structure. To obtain a specification of the limit of resolution 
for an object in the form of a grating, Abbe advanced the prin- 
ciple that the microscope objective must be able to collect the 
direct beam and at least one of the beams in the first order of 
diffraction. 

Suppose the sub-stage condenser produces at the grating object 
a beam incident at angle / in a medium of refractive index n, and 
that after diffraction by the grating the light emerges at an angle <?i 
in the first order of diffraction into a medium of refractive index 
n' (Fig. 8.2). If e is the separation between neighbouring slit centres 
in the grating, it follows from the well-known equation for a 
diffraction grating (Equation 11.20 of Volume 1) that, in the first 
order for light of wavelength X, 

e{ri sin 0i-« sin /) = I (8.3) 

For the direct beam, Snell's law of refraction applies, so 

ri sin = n sin / (8.4) 

where is the angle at which the undiffracted light emerges from 
the grating. 

Applying the Abbe principle that the direct beam and one of 
the first order diffracted beams from the object must be able to 
enter the objective lens, it is clear that the limit of resolution 
occurs when the direct beam enters the objective lens at one extrem- 
ity of its diameter and the first order beam enters this lens at the 
other extremity. Consequently, 

. 0=-0i (8.5) 

and must also equal U, the semi-angle at the vertex of the cone 
of rays entering the objective. 
From Equations 8.3 and 8.4, 

ein'sindi-n'sind) = X 
Applying Equation 8.5, 

2eri sin 6 = 2en' sin U = X 
Therefore 

05X 

e = ran/ < 8 - 6 ) 

where «' sin U is the numerical aperture (N.A.) of the microscope 
objective, and e is the smallest separation between adjacent lines 
m the grating which can be discerned. 



316 UNIVERSITY OPTICS 

Equation 8.6 is the Abbe equation for the limit of resolution o f 
a microscope; it is preferred to Equation 8.1 if the object viewed 
is not self-luminous but is illuminated by a sub-stage condenser 
lens. As the angle 6 can be reduced by varying the obliquity of the 
incident light from the sub-stage condenser on to the object, it 
follows that the numerical aperture of this condenser also decides 
the limit of resolution unless it is large enough to ensure that the 
limit to 8 is the angle U set by the microscope objective alone. 



8.4 Amplitude and Phase Objects 

In microscopy, the object viewed often has variations of optical 
density and colour. Such an object is called an amplitude object. 
Alternatively, sometimes the object is transparent and so does not 
exhibit density and colour variations, but has an optical thickness 
which varies from point to point over its surface. Such a phase 
object will introduce phase changes in the light that traverses it, but 
no optical density or colour variations. The eye is incapable of dis- 
cerning phase changes, though it is obviously able to respond to any 
density or colour variations. 

In normal use, therefore, the microscope is unable to examine 
structural changes in phase objects. As such changes are often of 
importance, the techniques of dark-ground illumination and also 
phase-contrast microscopy have been introduced to enable them 
to be discerned. 



8.5 Dark-ground Illumination and Phase Contrast 

The basic idea involved in both dark-ground illumination and 
phase-contrast techniques is illustrated by Fig. 8.3. Let A, B and C 
be three points out of the several thousands that may be involved in 
a transparent phase object. The light from the sub-stage condenser 
incident on the object will have its electric vector in a given direction 
at a given point such as A at a given time. On traversing the phase 
object at A, this electric vector will suffer a phase change depending 
on the optical path difference through the object at A. In Fig. 8.3, 
suppose the vector AX represents the amplitude and phase of the 
electric vector in the light emerging at A. At the same instant of 
time, the corresponding electric vectors at B and C axe represented 
by BY and CZ. The vectors will all have the same length if the object 
is uniformly illuminated and is uniformly transparent, but their 
angles of setting will be different because of the different optical 
thicknesses of the object at points A, B and C. The eye is unable to 
discern any differences between A, B and C. 



THE MICROSCOPE 



317 




Fig. 8.3. Principle involved in converting a phase object into an amplitude object 

Suppose that, by some means, light of constant amplitude and 
phase at a given time is added to the existing light at each of the 
points A, B and C. For convenience, let the vectors representing 
these added disturbances be horizontal in Fig. 8.3 and represented 
by the lines AD, BE and CF, all of equal lengths. The resultant vec- 
tors Ri, /?2 and Ra at A, B and C respectively are now clearly of 
different magnitudes, and the corresponding intensities Ii, 7 2 and 
h are also different. These different intensities can be discerned 
by the eye, so variations in the optical thickness of the phase object 
are effectively displayed. 

This concept can be expressed simply in mathematical form on 
the premise that the light at time t from such a point as A may be 
expressed as a sin cot, where a is the amplitude of its electric vector 
and the light is monochromatic of frequency eo/2jr. At point B, the 
expression concerned will be a sin (cot+x), where a is the phase an- 
gle between the electric vectors at A and B. If it is now assumed that 
light represented by b sin (cof +/?) is added at A and at B, the resul- 
tant effects are determined by the following expressions : at A 



a sin tot+b sin (cot+fl) 



and at B 



a sin (cot+a.)+b sin («<+/?) 
For simplicity, put a = b = 1 ; then at A 



and at B 



sin cof+sin (oot+fi) = 2 cos y sin jco/+-y j 
sin (ft)f+<x)+sin (cot+fi) = 2 cos -— - sin IcotJ — ~- J 



318 



UNIVERSITY OPTICS 



Hence at A the light now has an amplitude proportional to cos (/S/2), 
whereas at B the amplitude is proportional to cos [(oc-/S)/2]: 
a phase object is therefore converted into an amplitude object. 

It remains to consider how the required supplementary light of 
constant amplitude and phase can be provided. In the dark-ground 
method, the same disturbance is subtracted from every point such 
as A, B and C. The usual technique for this is to remove that light 
which passes undiffracted from the object through the microscope 
objective; thus the direct or zero order light is subtracted, leaving 
only diffracted light to form the image. This direct light from a small 




TO OBJECTIVE 



Fig. 8.4. Sub-stage condenser for dark-ground illumination 

region of the illuminated object is of constant, or nearly constant, 
amplitude and phase at a given time. It can be removed by placing 
a stop in the sub-stage condenser aperture (or in the microscope 
objective), or by employing other means so that no direct light 
traverses the objective (Fig. 8.4). The image in the field of view of 
the object is hence formed by light in the first and higher orders of 
diffraction, and the object appears bright against a dark back- 
ground. 



8.6 Phase-contrast Microscopy 

In phase-contrast techniques, a phase shift is introduced in part of 
the light at each point in the field of view. The direct (zero order) 
light is changed in phase relative to the diffracted light and not 
subtracted from the image-forming light as in dark-ground illumi- 
nation. The phase-contrast microscope was introduced in 1935 by 
Zernike. A simplified version of part of his description is of interest. 
In Fig. 8.5, let the amplitudes at a given time of the electric vec- 
tors in the light in the image plane due to points A, B and C in 
a phase object be OA, OB and OC respectively; these are conve- 
niently drawn from a common origin O and of equal length, so A, B 
and C are on the circumference of a circle of which O is the centre. 



THE MICROSCOPE 



319 




B \ RETARD (THICKNESS 
INCREASING) 



. . ADVANCE (THICKNESS 
B / DECREASING) 



Fig. 8.5. The principle of phase contrast 

For convenience, it is assumed that the optical thickness (and so 
the phase retardation) increases progressively from A to B to C. 
Suppose the vector OD represents the contribution which the direct 
(undiffracted) light makes at each point in the image plane. In phase 
contrast, the phase of this direct light is shifted relative to that of 
the diffracted light. Let this shift be an advance through 90°. 
Light of the same amplitude as the direct light, but leading it in 
phase by 90°, will be represented in the diagram by the vector ODi 
of the same length as OD. 

Consider the effect of shifting through +90° the direct part of the 
light which contributes to the image at point A. This is equivalent to 
removing from the light that due to direct illumination and substitu- 
ting in its place the same amount of light advanced in phase by 90°. 
The removal of the direct light corresponds to subtracting the vector 
OD (which is equivalent to moving the origin from O to D), so that 
at A the effect would be represented by a vector DA . The substitution 
of the light advanced by 90° would then be equivalent to moving 
the point A downwards by the distance OD±: but this is equivalent 
to leaving A where it is and instead shifting the origin from D to 
0i, a vertical distance 0Z>i above D in the diagram. 

It follows that the resultant amplitudes and settings of the electric 
vectors at the given time in the light disturbances in the image plane 
due to points A, B and C when the positive 90° phase shift has been 
introduced are represented by vectors OiA, 0\B and X C respec- 
tively. As the optical thickness of the phase object increases progres- 
sively from A to B to C, the amplitude of the light in the image plane 
decreases from 0\A to 0\B to 0iC. The thicker parts of the phase 
object therefore give rise to the smaller intensities in positive phase 
contrast (where the phase shift introduced is positive), which is gen- 
erally convenient. It is also possible to use negative phase contrast, 



320 



UNIVERSITY OPTICS 



in which the phase shift is negative, say — 90° ; the thicker parts of 
the phase object will then give rise to the greater light intensities in 
the field of view. 

In Fig. 8.5, dark-ground illumination would simply correspond to 
the transference of the origin from O to D. This illustrates a diffi- 
culty with dark-ground illumination, in that points placed symmet- 
rically about OD (such as B and B') would give equal resultant 
intensities decided by (DB) 2 and (DB 1 ) 2 . Dark-ground illumination 
is consequently unable to determine whether the phase is advanced 
by a certain amount owing to a decrease in thickness of the phase- 
object specimen, or retarded by the same amount owing to a corre- 
sponding increase in thickness. Phase-contrast, on the other hand, 
is able to do so. For points B and B', with positive phase-contrast of 
90°, the intensities are considerably different as they are decided 
by {OiEf and (Oii?') 2 . Again, objects are frequently of mixed ampli- 
tude and phase types. For a point on such an object where the 
absorption is significant, the origin will not be D in dark-ground 
illumination but E, for example. Intensities observed in the field 
of view may now be due to amplitude diminution either because 
of absorption or because of phase. With dark-ground methods, in 
some instances the effects of absorption cannot be distinguished 
from the effects of optical thickness. This second difficulty also 
exists with phase-contrast methods if the phase is ±90°, but it can 
be overcome if a shift of phase angle rather less than 90° is used. 

To introduce the necessary phase difference between the direct 
illumination (zero order) and the first and higher orders, a phase 
plate is used. A common arrangement is to place an annular aper- 
ture in the first focal plane of the sub-stage condenser lens with 
Kohler illumination, and the phase plate in the second focal plane 
of the microscope objective lens (Fig. 8.6). This phase plate consists 
of a disc of glass in which is cut an annular depression which is an 
exact counterpart of the annular aperture. The direct light through 
the annular aperture then traverses the annulus of the phase plate 
and suffers there a smaller path retardation than the remaining 



FIELD LENS L 

\ 



SOURCE S 
(LAMP FILAMENT) 



ANNULAR APERTURE 
IN fIRST FOCAL 
PLANE OF L, 

I 



OBJECTIVE 
LENS 




/ 



CONDENSER 
LENS Lj 



,PHASE PLATE IN 
SECOND FOCAL 
PLANE OF 



TO EYEPIECE 



V -i7 
\ 

OBJECT 
(MICROSCOPE SLIDE) 



Fig. 8.6. Annular aperture and phase plate in a phase-contrast microscope 



THE MICROSCOPE 



321 



diffracted light. The phase of the direct light is therefore not retard- 
ed so much as (or rather it is advanced with respect to) the diffrac- 
ted light, giving a positive phase-contrast set-up. The phase advance 
introduced by the phase plate is easily seen to be (2ji/A)(m— \)t, 
where n is the refractive index of the glass of the phase plate of 
thickness t, and A is the wavelength of the light concerned. 



8.7 Interference Microscopes 

Various types of interference microscope have been developed, 
but only one model is described in outline here. The light is divided 
by some form of beam-splitter so as to traverse two paths, one of 
which is modified by the specimen. When the two optical paths 
differ by /?A/2, where p is an integer and X is the wavelength of the 
monochromatic light used, the field of view in the microscope 
brought about by the subsequently reunited beams will be crossed 
by interference fringes of distribution and spacing related to the 
structure of the specimen being viewed. Fig. 8.7 shows the optical 







<- 




N F 


CEPIECE 








V *■ 






WAPHRASM 






'beam 


3 

, SEMI-TRANSPARENT 
^ METAL FILM 




/ 








BEAM 2 


i fuiT/.uer\ Tn n\ 




£ 


:. 








1 


V 


in i wiilu i w vs 






^ 








4 ^ 








s 




V 








^ W 










•*7 

L 








^B 


,y " 






i 


k 












BEAM 












' 


r 










4- 

P r 




N n 














v ° 









Fig. 8.7. An interference microscope 



paths in an interference microscope used to study surface structure by 
reflection. Light from the monochromatic source S is collimated by 
the lens L and divided by the beam-splitter B (which consists of 
a glass cube with a partially aluminised diagonal plane, the two 
halves of the cube being cemented together). Beam 1 is incident 
upon and reflected back by the specimen P after traversing the 
microscope objective O, whereas the other beam 2 travels along the 
perpendicular path to the compensating objective O' and is reflected 



322 UNIVERSITY OPTICS 

back by the plane mirror M. The two reflected beams, one of which 
is modified by the surface of the specimen, are reunited to interfere 
along the direction of beam 3. Beam 3 reaches the eyepiece in 
the field of view, in which the specimen surface topography is 
displayed as an interference pattern. The dark or bright lines in 
this pattern will be at positions corresponding to path differences 
of A/2 and will correspond to contour lines in the surface of the 
specimen at height intervals of A/2. 



8.8 The Ultra-violet Microscope 

If ultra-violet radiation of wavelength, say, 2,500 A is used instead 
of visible light at 5,000 A, then h in Equation 8.1 or Equation 8.2 
will be halved for the same numerical aperture n sin U. The resolving 
power is therefore doubled. This advantage is not so significant in 
practice as the fact that many substances, particularly biological 
specimens such as the important nucleic acids, exhibit strong ab- 
sorption in the ultra-violet between 2,400 A and 4,000 A although 
they are virtually transparent to visible light. Furthermore, signif- 
icant reflection and absorption phenomena which are of consider- 
able interest in the examination of structure occur in the ultra- 
violet in several specimens other than biological ones. 

The main problem in the design of an ultra-violet microscope is 
that glass optics cannot be used because of the absorption of ultra- 
violet radiation in glass. Fused quartz objectives (and also quartz- 
fluoride and quartz-lithium fluoride objectives with some degree of 
chromatic aberration correction) are used, but the chief advances 
in ultra-violet microscopy resulted from the advent of reflecting 
microscope objectives (Johnson, 1934). A two-mirror reflecting 



CONVEX 
MIRROR 




CONCAVE 
MIRROR 



SPECIMEN 

Fig. 8.8. A two-mirror reflecting microscope objective 



THE MICROSCOPE 323 

microscope objective (Fig. 8.8) consists of a concave mirror with 
a central aperture concentric with a smaller convex mirror. Such 
a system can be designed to be free of spherical aberration and 
coma. The numerical aperture is limited in this design to about 0-5 
because of the obstruction due to the small convex mirror. Burch 
designed a reflecting objective in which the concave mirror was 
aspherical, and this allowed the numerical aperture to be increased 
to 0-65 or more. To obtain significantly larger numerical apertures, 
such reflector systems are used in conjunction with a quartz- 
fluorite front-immersion lens. 

The source of ultra-violet radiation used is best substantially 
monochromatic, so must produce well-spaced spectrum lines. Both 
the spark discharge between cadmium electrodes and the medium- 
pressure quartz-enveloped mercury lamp are employed. Filters are 
sometimes used to select the particular ultra-violet spectrum line 
of interest. These are either liquid filters in quartz parallel-sided 
containers or interference filters. 

Some instruments incorporate an eyepiece which is either a re- 
fracting type made from fused quartz or a reflecting design; in 
other designs, an eyepiece is not used. To record the image, it may 
be focused at a fluorescent screen, but as the brilliance is usually 
inadequate, the common practice is to record the image on a pho- 
tographic film or plate. 



Exercise 8 

1. Write an account of the Abbe theory of the limit of resolution of a micro- 
scope in which the object is in the form of a grating illuminated by mono- 
chromatic light. 

Calculate the minimum separation that could be observed between parallel 
lines in such a grating by a microscope with a numerical aperture of 0-7 if 
light of wavelength 5,000 A were employed. 

2. Explain the physical principles involved in the use in microscopy of: 
(a) dark-ground illumination, and (b) phase contrast. What are the advan- 
tages of method (b) over method (a)? 

Draw a ray-path diagram showing clearly the positions in a phase-contrast 
microscope with Kohler illumination of the annular aperture and the com- 
plementary phase plate. 

3. Two of the most important instruments in science are the telescope and 
the microscope. Write an account of one of these describing (i) its basic 
design and the features necessary to give optimum performance and (ii) 
how it has been developed to cover wide wavelength ranges. (L.P.) 

4. Describe, with an outline diagram, an interference microscope suitable 
for examining by reflection the surface topography of a metallic specimen. 



324 UNIVERSITY OPTICS 

5. What are the main reasons for the development of ultra-violet microscopes ? 
Describe in outline a typical reflecting objective for an ultraviolet micro- 
scope and outline briefly the other main features of the design of such an 
instrument. 

6. Calculate the limit of resolution in observing self-luminous objects of the 
following microscopes: (a) an instrument with an objective of numerical 
aperture 0-85 utilising light of wavelength 4,500 A; (b) an ultra-violet micro- 
scope with a reflecting objective of numerical aperture 0-65 where the ultra- 
violet radiation has a wavelength of 2,400 A; and (c) an electron microscope 
with a numerical aperture of 001 in which the electrons are accelerated 
through a potential difference of 50 kV. 



CHAPTER 9 



Coherence, Optical Masers 
and Holography 



9.1 The Coherence of Electromagnetic Radiation 

From the point of view of coherence, a source of electromagnetic 
radiation should, ideally, give rise to infinitely long, continuous 
wavetrains which are in phase and where the frequency and ampli- 
tude of the vibrations are constant at all points in the source. Such 
a source has not been developed; the nearest approach is the optical 
maser (laser), which still falls considerably short of the ideal. 

Three chief types of incoherence are conveniently classified. 
The first results from the finite size of the source and is called 
spatial incoherence. Spatial incoherence is due to the fact that the 
wavefront does not emanate from an area of the source in which all 
points are in phase. A perfectly spatially coherent beam of radiation 
would thus be one in which all points in the wavefront were in 
phase. The second type of incoherence is known as temporal inco- 
herence and results from the finite lengths of pulses of radiation pro- 
duced by the source; in order to separate temporal incoherence 
from spatial incoherence as a phenomenon, a very small area of 
the source is specified. The third type is amplitude and frequency 
incoherence, which arises from perturbations in the source condi- 
tions. 



9.2 Temporal Incoherence 

To illustrate the effects of temporal incoherence, consider as an 
example Lloyd's single-mirror experiment on interference, already 



326 UNIVERSITY OPTICS 

described in Section 8.5 of Volume 1. The two rays SP and SQP 
(Fig. 9. 1) interfere at P to give rise to a fringe system of the 'cosine- 
squared' type. The visibility V of these fringes is defined by the 
equation 

T 1 

V = 



t+In 



where / niax is the intensity at the centre of a given bright fringe, and 
I mio is the intensity at the centre of an adjacent dark fringe. 

The visibility V will depend on the coherence of the source of 
light S and may be used as a measure of this coherence for a given 
path difference A = SQP-SP. If it is postulated that the source is 




MIRROR 

Fig. 9.1. The typical interfering rays in Lloyd's single-mirror experiment 

of a very small area in which the vibrations are of constant ampli- 
tude and frequency and the intensity distribution corresponds 
to a Dirac delta-function (see Appendix, page 366), the value of 
V will give a measure of the temporal coherence of the source. This 
is obvious in general terms, because the coherence at P depends 
on the degree of correlation between the electric field vectors due to 
the wavetrains SPand SQP. If the pulse of radiation from the source 
is shorter in length than the path difference A, the correlation will be 
zero and no coherence results. Generally, of course, the pulses have 
varying lengths and the visibility V will only have its maximum 
value of unity when the path difference A is zero because then all 
pulses give perfect correlation. As the value of the path difference 
A increases for a given length of pulse from the source, the visibility 
decreases. 

9.3 The Relation Between the Spectral Line Width of the Source and 
the Length of the Radiated Pulse 

The energy of the photon emitted by an excited atom or molecule is 
decided by the energy transition AE which an electron undergoes 
in moving from any energy level E x to a lower level E 2 , where 
Ei—E 2 = AE. The frequency of the corresponding radiation is 
given by 

AE = hv 

where h is Planck's constant. 



COHERENCE, OPTICAL MASERS AND HOLOGRAPHY 



327 



The intensity of the spectral line resulting depends on the number 
of electrons per second which make such transitions. In the quan- 
tum-mechanical treatment of atomic structure, it is shown that 
the electrons do not occupy specific orbits but have a probability 
of being at a given location decided by | W | 2 , where | W\ is the 
modulus of the complex amplitude of the wave function at the 
location in question (Section 2.15). The many electrons per second 
which make a transition from energy level £1 to energy level E 2 do 
not therefore all undergo exactly the same energy change of AE, 
because each of the levels E± and E 2 has a certain natural width 
in accordance with the quantum-mechanical considerations. There 
is consequently a finite range of frequencies over the region 
± Av around a given frequency v, and a corresponding amplitude 
of the energy radiated expressible as/(v) dv. This decides the natural 
width of the spectral line from an isolated gas or molecule. Doppler 
effect (Section 5.8) will cause additional line broadening. 




FREQUENCY, 



Fig. 9.2. Curve of amplitude against frequency for light in a spectral line, 
indicating natural width 



For a source of very limited area, a given spectral line will 
therefore correspond to a small range of frequencies (Fig. 9.2). 
At a point such as P in the experiment illustrated by Fig. 9.1, a 
single pulse will now be a Fourier sum of sinusoidal components, 
so the amplitude of the electric vector in the radiation at P and 
at time t will be given by 



E(t) 



-h 



v) exp (2njvt) dv 



(9.1) 



328 



UNIVERSITY OPTICS 



The right-hand side of Equation 9.1 is the Fourier transform (see 
Appendix, page 362) of /()>), and it must follow also that 



f(v) = I E(t) exp (-2jrjj>0 dt 



because the spectral amplitude distribution is the Fourier trans- 
form of the time-modulated amplitude at the point P. It can now 
be seen why a finite pulse is inevitably radiated in practice, because 



TOP-HAT 



MO 




SINE WAVE OF FREQUENCY v„ 

(°) 




Fig. 9.3. Ideal source: (a) sinusoidal pulse modulated by a 'top-hat' waveform; 
(b) spectral line distribution 



an inverse relationship exists between the half-width of the spectral 
line amplitude distribution and the amplitude modulation at 
point P. 

The ideal source has a Dirac delta-function profile of which the 
Fourier transform is a constant giving a continuous sinusoidal 
monochromatic pulse. 

Fig. 9.3(a) shows a simple hypothetical example of such a 
source where a sinusoidal pulse of frequency v is modulated by a 
'top-hat' waveform to give the spectral line amplitude distribution 
shown in Fig. 9.3(b). 



COHERENCE, OPTICAL MASERS AND HOLOGRAPHY 329 

Referring to Fig. 9.3(a), 

f(v) = E(i) exp (-2jtjvt) At 



A r . V° 



or 



7^-^-Iexp (-2^cjvr )-exp (2jtj^ )] 

Z.71JV 

A sin 2jri'«'o A sin a 



/(") = 



2^v^o 'a 



where .4 is a constant and « = 2srvf - 

The width of the spectral line is proportional to l/t . As the pulse 
must begin at t = 0, the relationship between the pulse duration 
and the spectral frequency spread of Aj> = 2Av at the first zero 
a = re is seen to be 

* = -k (9 - 2) 

Most spectral line profiles are Gaussian distribution in shape, 
giving rise to a pulse modulated by a Gaussian amplitude profile. 
The optical maser may be regarded to a first approximation as 
having a 'top-hat' modulation with a very broad (sin «)/a profile, 
so approximating to the ideal. 



9.4 Coherence Time and Coherence Length 

An arbitrary definition of coherence time may be referred for 
simplicity to the ideal situation illustrated by Fig. 9.3. From 
Equation 9.2, the coherence time t is 1/Av , and the coherence 
length is defined as 

/. = <*. - £- q (9.3) 

where c is the velocity of light. As v = c/A , where A is the corre- 
sponding wavelength, 

dv _ c 
d^~~I§ 
Therefore 

Av =- ~r AA 



330 
and 



UNIVERSITY OPTICS 



/o 



Aj> AAo 



(9.4) 



Assume that a visible spectroscopic line has this ideal shape — 
i.e. as in Fig. 9.3(b). A very good optical source of monochromat- 
light would have a frequency spread Av of about 10 10 Hz for 
a frequency v of 6X10" Hz corresponding to a wavelength of 
5,000 A. The coherence length h would then be given by Equation 
9.3 as 

c 3X10 10 - 
/o:= A^ = -iO^ = 3cm 

True coherence for path differences exceeding 3 cm would therefore 
not be expected for this source. 

Optical masers (lasers) vary considerably in the stability of their 
output. An ideal optical maser line and the corresponding pulse 
shapes are shown in Fig. 9.4. The same relationship holds as in 
Equation 9.2, because the profiles concerned are simply inter- 
changed. A continuous gas laser at room temperature will have a 
short-term value of Av as small as 10 Hz. The corresponding value 
of the coherence length /o is given by Equation 9.3 to be 



/o 



3X10 10 
10 



3xl0 9 cm 



For a pulsed ruby optical maser emitting light of wavelength 
A = 6,943 A, Aj>o is approximately 5x 10 6 , so 

3v 10 10 
/o= W =3 '° OOCm 

which is still remarkable, though it must be remembered that these 
calculations are for ideal lines. 



«(») 



A»„ 




(Q) 

Fig. 9.4. Spectral line from an optical maser: (a) ideal spectral line; (b) corre- 
sponding pulse shape 



COHERENCE, OPTICAL MASERS AND HOLOGRAPHY 33 1 

9.5 Light Amplification by the Stimulated Emission of Radiation: 
the Laser Principle 

Let two energy levels E p and E q exist in a material, where E p > E q 
and the populations of these levels are N p and N q per unit volume 
of material respectively. If a transition takes place from E p to E„, 
the frequency v of the photon ejected is given by 

E p —Ea 



where h is Planck's constant. 

If this material is in thermal equilibrium at temperature T°K, 
the number of transitions per second from E p to E q is decided by 
Equation 2.69, which gives the rate of decay of the excited state. 
This number will be 

AN p +BU T (v)N p -BU T (v)N q 

because C = B,N e = N p and N = N q ; also, the number of tran- 
sitions is equal to -dN e /dt = -dN p /dt. Here, A is the Einstein 
coefficient for spontaneous emission, and B is the Einstein coefficient 
for absorption. The function U T (v) is the radiant energy per unit 
volume at the temperature T. 

Each of these emitted photons has an energy hv, so the total 
radiated energy per second per unit volume of material at the 
frequency v is given by 

g(v) = [AN p +BU T (v)(N p -N q )]hv 
Equation 2.73 states that 



Therefore 

g(y) = Ahv 



A _ Snhv 3 
B ~ c 3 



Now from Equation 2.64 with g = 1, 

N p l E p -E q \ / hv\ /n „ 

where k is Boltzmann's constant. As shown is Section 2.23, 
exp(—hv/kT) is an exceedingly small number at optical frequencies 
and temperatures of about 300°K. Under normal conditions, 



332 UNIVERSITY OPTICS 

therefore, N p « N q and the right-hand side of Equation 9.5 is 
negative corresponding to g(y) being negative; consequently, 
absorption of photons occurs in the material, with the number of 
spontaneous emissions exceeding greatly the number of stimulated 
emissions. 

If, by some means, the electron populations can be inverted so 
that N p > N q then g(v) becomes positive, as is seen from Equation 
9.5. The number of stimulated emissions will then preponderate, 
and the radiated power output from the material will be greatly 
increased. This population inversion demands a pumping action 
leading to optical maser (or laser) action. The term laser arises 
from the initial letters of 'Light Amplification by the Stimulated 
Emission of Radiation'. Lasing will be just possible when A^ = N q 
so that, from Equation 9.6, hv = IcT. 

Presume that N p > N q , where N p and N q are the populations per 
unit volume. Let radiation of intensity / be incident on a slab of 
material of thickness dx and unit cross-sectional area. This inten- 
sity / is the energy per unit area per second, and it will be related 
to the energy density of radiation U(v) by 

/ = U(v)c (9.7) 

where c is the velocity of light. 
If N p > N q , Equation 9.5 becomes approximately 

which, making use of Equation 9.7, gives 

But g{y) is the radiated energy per unit volume of the material at 
the frequency v, and so for a thickness dx of unit cross-section, 
the radiated energy is a g(v)dx. This must equal the change of / by 
dl. Therefore 

so 

d/ Ac 2 ,„ Ar , , 

The intensity / emerging after a path of length x in the material, 
where the incident intensity is h, is therefore given by 

1= h exp (ax) (9.8) 



COHERENCE, OPTICAL MASERS AND HOLOGRAPHY 333 

where 

a= 8^-^ (9 - 9) 

Note that Equation 9.8 is comparable with the usual expression 
/ = 7 exp(— [ix) for the absorption of radiation in a material of 
linear coefficient /x but where this absoprtion coefficient is negative. 
The laser action therefore amplifies the light intensity. For a 
helium-neon gas laser, typical values of a in Equation 9.8 are 
given in Table 9.1. In practice Doppler effect tends to reduce this 
gain. 



Table 9.1. 

TYPICAL VALUES OF <X FOR A HELIUM-NEON GAS LASER 


Wavelength 
(A) 


a per metre 


6,328 
11,523 
33,913 


4,342 

21,710 

1,128,920 



9.6 The Laser as an Optical Oscillator 

Population inversion is demanded in the laser. If provided, it is 
seen from the previous section that the laser acts as an amplifier 
analogous to an electronic amplifier. Pursuing this analogy, the 
requirements of an oscillator developed from an amplifier are 
positive feedback and a high g-factor for sharpness of resonance. 

In 1958, Schawlow and Townes suggested that the Fabry-Perot 
interferometer could be used as an optical oscillator if a suitable 
gaseous medium was placed between the Fabry-Perot plates in 
which laser action could be established. The idea is that the beam 
of light to be amplified in intensity (generated originally by an 
electron transition between energy levels in a discharge set up 
in the gas) is partly reflected at one of the Fabry-Perot mirrors to 
provide the positive feedback and totally reflected at the other 
mirror. The amplified output light beam emerges through the partly 
reflecting mirror. 

In the simplest example, the Fabry-Perot etalon comprises two 
small flat mirrors A and B accurately parallel to each other — one 
at each end of a cylindrical tube 1 m long containing the gas. 
An arrangement of this kind (Fig. 9.5) can be thought of as a 
resonant cavity, analogous to a sonic cavity, containing a standing 



334 UNIVERSITY OPTICS 

wave system of wavelength A where 

n being an integer and d the plate separation. 

A fraction of the energy forms the standing wave system, and 
the remaining fraction is the output from the partly reflecting 
mirror B in Fig. 9.5. A large number of modes of the same and 
different wavelengths would be supported, because d is large and 
an infinite number of reflections are possible. If such a system were 




USING MEDIUM (S»S) 

d 



USER 
OUTPUT 



Fig. 9.5. Optical standing wave system between Fabry-Perot mirrors in the 
cavity of a gas laser 

used as a laser, much of the energy would be wasted among unde- 
sirable modes. The large mirror separation confines the energy to 
primarily one axial mode; other modes of the same wavelength 
give energy lost through the sides of the cavity and hence not 
transmitted through the end mirror B. Such non-axial modes rep- 
resent a diffraction loss of energy. Axial modes of various wave- 
lengths could be transmitted, but in practice are unlikely to be 
generated because the wavelengths concerned are decided by the 
discrete energy levels present in the gaseous medium. As non-axial 
modes are lost, the resulting axial mode is highly directional. 
Ideally, it is assumed that over one end mirror, a uniform electric 
field exists in the radiation at a given time, giving rise to a uniform 
field at the other end mirror. In practice, diffraction losses occur 
and the beam spreads slightly from its perfect plane wavefront 
shape as it travels to and fro between the mirrors. This leads to 
an electric field distribution which falls off towards the outside of 
the plates, so the emergent beam diverges slightly. 

In the ideal situation, the laser beam emerges from the plate B as 
a perfectly coherent wavefront with a constant electric field ampli- 
tude at the exit of this plate. If this plate B is a circular disc of 
radius a, substantially all the energy lies within the angle dd (Fig. 
9.6) where 



COHERENCE, OPTICAL MASERS AND HOLOGRAPHY 

se. 

-► 



335 



PLATE MIRROR AT 
EMERGENT END OF 
LASER CAVITY 

Fig. 9.6. Laser light which is emerging from a circular end plate and is subject to 
Fraunhofer diffraction 

For example, for such a laser employing a helium-neon mixture 

as the gas, at a wavelength of 6,328 A with end plates of diameter 

2a = 1 cm, 

„ 1-22X6-328X10- 5 „ 

do = = = 8 approx. 

In practice, the fall off of the electric field towards the edge of 
the end plates is equivalent in its effect to a smaller aperture than 
one of diameter 2a, leading to an increased value of 66. 



9.7 The Line Width of the Output from a Fabry-Perot Type 
Cavity Laser 

The intensity / of the light transmitted by a Fabry-Perot etalon is 
given by Equation 4.15 as 

■*max 



/ = 



l+Fsin 2 (<5/2) 



(9.10) 



where F = 4r/(l — rf, r being the reflection coefficient, and b = 
(2tt/A) (2d cos 6). In these equations, / max is the intensity at the 
central maximum in the interference pattern, / is the intensity at 
a neighbouring position for light different in phase by d, 6 is the 
angle of incidence of the light at the etalon, and X is the wavelength. 
Equation 9.10 is deduced on the assumption that an infinite 
number of reflections are at the end plates, and it is only an approxi- 
mate result when the value of d is large compared with the plate 
dimensions. At / = 0-5 / max , 

l+Fsin 2 |- = 2 



336 




UNIVERSITY OPTICS 


Therefore 




. 2 6 1 
sin 2=T 


and 




. b 1 

Sm 2=VF 


As b is small, 








b 


2 1-r 
= —pf = — T~ a PP rox 



A change of b by 2n corresponds to a change in order by unity. 
Therefore 

2^ = 8po 

where bp is the change in order po at the centre corresponding to d. 
Therefore 

5po _ b/2jc _ \—r 

po Po 2np V 

As the frequency v of the radiation is given by 

c 



(9.11) 



where c is the velocity of light, 



Therefore 



so, numerically, 



Po = 



bp = - 



2d 2dc 



2dcbv 



bp _ 8v 
Po ~ v 



Hence, from Equation 9.11, 



)!> = 



{\-r)v 



2np \/ r 

The half-width of the frequency band transmitted by the Fabry- 
Perot etalon is therefore given by 

(l-r)c 



2bv = Av = 
because v = cjK and po = 2d\X. 



2nd\/r 



(9.12) 



COHERENCE, OPTICAL MASERS AND HOLOGRAPHY 337 

For a reflection coefficient r = 0-97 and d= 1-0 m, 

0-03X3X10 8 ,..„ 
Av = 2.VC0-97) = ^ MHZ BPPr0X - 

which result is independent of the frequency v. If, therefore, a gas 
laser 1 m long with reflection coefficient effectively 0-97 at the end 
mirrors can be constructed, the half-width of the frequency band 
is nominally 1-5X10 6 Hz, which is very small at optical fre- 
quencies of the order of lO^lO 15 Hz. 

The line width of the spontaneous emission from a gas laser is, 
in fact, governed chiefly by Doppler effect (Section 5.8) and is of 
the order of 10 9 Hz for a temperature of 20O°-30O°C typically. 
It can be shown tht the line width Av of the stimulated emission is 
related to the line width Aj> s of the spontaneous emission by the 
relation 

Av s oc ^ (9.13) 

where P is the power level of the system. This gives a typical value 
in the visible region of 

A s 

where s is an integer and P is the power output in watts. 

For a gas laser, the threshold power is of the order of a milliwatt, 
which gives rise to an output of the order of a watt. The corre- 
sponding order of the line widths thus varies between 10 3 Hz and 
1 Hz. These are instantaneous values; long-term values may not 
be so good because of instability. It is, however, possible, to mix 
the frequencies of the outputs from two independent lasers and 
control the resonator lengths to within 10 -3 A, so giving a fre- 
quency drift of less than 30 Hz in the interference pattern. 



9.8 Optical Pumping 

Though the general idea of the gas laser has been described, it has 
been assumed so far that population inversion has been obtained. 
In fact, the basic problem in the use of any active material in which 
laser action is to be supported is to invert the population between 
the levels of a transition so that the probability of stimulated 
emission exceeds greatly that of spontaneous emission. This poses 
problems in the method of excitation and in the energy levels 
chosen. 



338 



UNIVERSITY OPTICS 



The easiest way to invert the population between two energy 
levels E p and E q would be to irradiate the material with photons 
from an external source of frequency v equal to that corresponding 
to the transition, i.e. where v = (E p —E q )/h. However, this radiation 
would inevitably give rise to stimulated emission from the upper to 
the lower level, as well as vice versa. The result would be the estab- 
lishment of an equilibrium of population densities. This difficulty 
can be overcome to some extent by making the radiation discontin- 
uous; stimulated emission is then possible, but with very limited 
amplification. 



INCIDENT RADIATION OF " 

FREQUENCY V|J = (E 3 -E|)/h 



SPONTANEOUS EMISSION OF 
FREQUENCY v, 2 = (Ej-E^J/h 



METASTABLE LEVEL 



, , STIMULATED EMISSION OF 
T FREQUENCY » 2| = (E 2 -E,)/h 



Fig. 9.7. Stimulated emission from a three-energy-level system, where metas table 
level E t becomes densely populated 

A better system involves the use of three energy levels of which 
the intermediate one is metastable and so has a considerably longer 
lifetime than the outer levels. In Fig. 9.7, three energy levels E u E 2 
and E 3 exist in the material, where E 3 > E 2 > E x , and E 2 is meta- 
stable. The incident radiation from an external source is of fre- 
quency J>i3 where 

hv 13 = E 3 -E x 

so that it raises electrons from level Ex to level E 3 . The electrons 
in the E 3 level relax rapidly to the E 2 level and in so doing give rise 
to spontaneous emission of radiation of frequency j> 32 given by 

nv 3 2 = E 3 —E 2 

As E 2 is a comparatively long-lived metastable level, electrons 
accumulate at this level. When this process has proceeded for a 
sufficient time, an incident beam of radiation of frequency v 2i 
from an external source where 

hv 2 i = E 2 —Ex 

will give rise to stimulated emission between the levels E 2 and Ei, 
the population level of E 2 may also then exceed that of level E x , 



COHERENCE, OPTICAL MASERS AND HOLOGRAPHY 



339 



because level Ex has been robbed of electrons which go temporarily 
to level E 3 whereas accumulation of electrons is significant at level 
E 2 . 

This three-level system is particularly advantageous if the tran- 
sition from level Ex to level E 3 corresponds to a broad spectral line, 
whereas that from E 2 to Ex corresponds to a narrow line. The light 
used for optical pumping (to raise electrons from Ei to E 3 ) may 
then have a broad spectral range, while the spectral range of the 
stimulated emission is narrow. 



'• SPONTANEOUS EMISSION OF FREQUENCY ».. 



METASTABU 



INCIDENT RADIATION 
OF FREQUENCY v 




•' STIMULATED EMISSION OF FREQUENCY »,. 



' • SPONTANEOUS EMISSION OF FREQUENCY > 2| 



Fig. 9.8. Stimulated emission from a four-energy-level system 



Fig. 9.8 illustrates a four-level system which has the advantage 
of producing a population inversion more quickly than the three- 
level system, although it still retains the advantages of a three-level 
system. The metastable level is now E 3 , so the probabilities of the 
spontaneous transmissions Ei to E 3 and E 2 to Ex are much greater 
than E 3 to E 2 . Hence level E 3 fills up with electrons, as in the three- 
level system, when radiation of frequency j^ is incident. This is 
because relaxation is rapid from E t to E 3 , and E 2 empties rapidly 
owing to the transition E 2 to Ex as the incident radiation of fre- 
quency ogives rise to stimulated emission from^ toE 2 . The absorp- 
tion line E 2 to E 3 will be sharp, and the absorption line Ex to E*, 
broad. 

Materials with three or four energy levels suitable for optical 
pumping initiated by incident light of appropriate frequency 
include crystals, glasses, plastics, organic liquids and semiconductor 
materials. Doped solid-state crystalline materials are particularly 
suitable as they have broad absorption energy bands. Optical pump- 
ing is not generally suitable for pure gases because the absorption 
energy levels are too narrow; gas mixtures, such as helium-neon, 
are excited usually by other means than optical pumping. 



340 



UNIVERSITY OPTICS 



9.9 The Ruby Laser 

An important example of a solid-state laser with a three-energy- 
level system in which optical pumping is used is the ruby laser. 
Ruby consists of aluminium oxide (AI2O3) in which 0-01-0-05% 
of the Al 3+ ions have been replaced by Cr^ + ions. It is only these 
chromium ions which take part in the laser action. 

The energy levels in ruby which participate in the laser action 
are shown in Fig. 9.9. Level 1 is the ground state, the two close 
levels 2 are metastable energy levels, and levels 3 are in a broad 
energy band. 

On the incidence of green light on the ruby, absorption takes 
place because of transitions from the ground state to the broad 
band of levels 3. From this excited band, electrons fall to the doub- 
let levels 2, which are metastable and so have a long lifetime. 



Y/////////////A 



ENERGY 
BAND 3 



METASTABLE LEVELS 2. 
WHICH BECOME DENSELY 
POPULATED 



TRANSITION DUE 
TO ABSORPTION OF 
GREEN LIGHT 
(OPTICAL pumping) 



LEVEL I 




STIMULATED EMISSION : 
6.943 A (STRONG) 

STIMULATED EMISSION: 
6.929A (WEAK) 



GROUND STATE 



Fig. 9.9. Energy levels for chromium ruby showing transitions involved in optical 
pumping, spontaneous and stimulated emissions 



As the broad band 3 can absorb a good deal of energy from the 
incident green light, the transitions to levels 2 bring about a high 
population in levels 2 — greater than that of the ground state. The 
laser emission is primarily from the lower of the two levels 2 to 
ground; this is red light at a wavelength of 6,943 A at room tem- 
perature. Transitions to the ground state also occur from the 
upper of the two levels 2, giving a wavelength of 6,929 A at room 
temperature. This light is less intense than that at 6,943 A, because 



COHERENCE, OPTICAL MASERS AND HOLOGRAPHY 

RUBY BOO 



341 




PARTLY SILVERED 
PLANE MIRROR 



LASER OUTPUT 
BEAM 



FULLY SILVERED 
PLANE MIRROR 



HELICAL ELECTRONIC 
NASHLAMP 



Fig. 9.10. Pulsed ruby laser (schematic diagram) 

the upper level of the doublet levels 2 is less populated than the 
lower one. 

The practical arrangement of a ruby laser (Fig. 9.10) consists 
of a cylindrical ruby rod of 2-20 cm in length and 0-1-2 cm in 
diameter, with optically polished plane-parallel ends. One end is 
fully reflecting and the other end is partly reflecting. This ruby 
rod is surrounded by a helical xenon lamp which is flashed by the 
discharge of capacitors through it. The pulsed ruby laser gives a 
laser output at wavelength of 6,943 A of about 0-2 J for an input 
energy from the xenon lamp of about 5,000 J. Continuously oper- 
ated ruby lasers are also made in which the flashlamp output 




Fig. 9.11. Elliptical reflector which will focus light from a strip electronic 
flashlamp at a cylindrical laser 



342 



UNIVERSITY OPTICS 



must be intensely focused by making use of internal reflectors in a 
small cone-shaped ruby rod. 

Optical pumping is usually initiated by high-pressure xenon 
lamps giving a flash which lasts for 0-5-2 ms. Elliptical reflectors 
have been used (Fig. 9.11) with the flashlamp at one focus and the 
laser at the other. Unless the power supplied by the incident light 
is very high and well above the threshold pumping power, the 
output of laser light will be discontinuous or 'spiked' as a result 
of temporary relapses in the state of population inversion produced 
by the flashlamp. 



9.10 Gas Lasers 

Several different types of gas laser have been made. The first to be 
operated successfully consisted of a mixture of helium at a pressure 
of approximately 1 mmHg and neon at a pressure of approximately 
0-1 mmHg; it was designed by Jevons and his colleagues in 1961. 
A more recent version of this tube (Fig. 9.12) consists of this gas 



MIRROR 
/ (98V« TRANSMISSION) 



f^Sz.. 




LASER 
BEAM 



L.T. SUPPLY 



MIRROR 

(98V. TRANSMISSION) 



Fig. 9.12. A d.c. operated helium-neon gas laser 



mixture within a cylindrical pyrex tube with optically polished glass 
end plates set at the Brewster angle I B . Outside the Brewster 
windows are the Fabry-Perot plane-parallel mirrors of silica, each 
coated with multiple dielectric layers to give a reflection coefficient 
of 98-99% and about 0-3% transmission. It is convenient for end 
windows to be set at the Brewster angle to the tube, because they 
enable one linearly polarised component of the light to undergo 
very small reflection loss. The Fabry-Perot mirrors can then be 
outside the tube with little loss of this polarised light in the Brewster 
windows. These mirrors can be set in mounts for ready adjustment 
in 'tuning' the gas laser to optimum output. The laser tube itself 
is about 70 cm long, and a laser beam emerges at both ends. 
Fig. 9.12 illustrates a d.c. operated tube with a coated-filament 



COHERENCE, OPTICAL MASERS AND HOLOGRAPHY 



343 



source of electrons at one end and an anode at the other. High- 
frequency (30 MHz) operated discharges are also used (with the 
advantage that the electrodes can be exterior to the laser tube), 
but d.c. operated gas lasers seem to have a longer lifetime. 

For the helium-neon gas laser, a power input from the gaseous 
discharge of about 50 mW is required — much lower than for 
optically pumped solid-state lasers — and the operation is contin- 
uous. 

The energy levels appropriate to the operation of this gas laser 
are shown in Fig. 9.13. The neon gas is the active agent although 
it is present at a pressure of only 0- 1 mmHg ; the helium, at 1 mmHg, 



t 



} 3s (four levels) 

6.J2M 



ENERGY 




2p(K> levels) 



CROUND STATE 
NEON 



Fig. 9.13. Energy levels involved in the operation of the helium-neon gas laser 



is necessary for energy transfer. The pumping in this kind of gas 
laser is not optical but is by electron impact. The direct current 
through the gas (or the excitation by a high-frequency electric 
field) creates the necessary electrons in the gas discharge. These 
electrons excite the helium atoms to the metastable 2 1 S and 2 3 S 
levels. Collisions of the excited helium atoms with the neon atoms 
result in a transfer of energy to the neon atoms. The close multiplet 
energy levels 3s and 2s of electrons in the neon thereby become 
populated. The probability of direct excitation of these levels in 
neon is very small; it is therefore necessary to populate them by 
energy transfer from the excited helium atoms, otherwise the 
mixture of gases used would be pointless. 

These 3s and 2s electron levels of neon are metastable so have 
comparatively long lifetimes. The lower 2p levels of neon, on the 
other hand, are not long-lived, so depopulate very rapidly to the 
Is levels. Population inversion is consequently obtained between 
3s and 2p, and also between 2s and 2p. The stimulated emission 
responsible for the laser light is consequently 3s — 2p and 2s — 2p. 
The transition 2p — Is gives incoherent radiation, which is the 
characteristic reddish neon discharge light. The large number of 
lines in the stimulated emissions (the 3s and the 2s levels are both 



344 UNIVERSITY OPTICS 

quadruplets, whilst the 2p consists of 10 close levels) give rise to a 
corresponding multiplicity of laser lines. Isolation of particular 
lines in the gas laser output is therefore a problem of mode control, 
dependent upon the Fabry-Perot arrangement of the mirrors 
(Section 9.6). The wavelengths at which the helium-neon gas laser 
operates are 6,328 A and in the range 10,000-15,000 A (1-1-5 /xm) 
with the strongest radiation at 11,523 A. 

Other gases which have been used in laser systems are pure 
noble gases, and mixtures of noble gases with either oxygen, carbon 
monoxide, carbon dioxide, nitrogen or some others gases. Optical 
pumping has been achieved in a caesium-filled vapour laser with a 
helium flashlamp. The use of a mixture such as a noble gas with 
oxygen excited in a high-frequency discharge operates by virtue 
of dissociation of the oxygen into its constituent atoms when 
subjected to the impact of the excited noble gas atoms. One of the 
oxygen atoms remains in an excited state from which it radiates 
stimulated emission lines. 



9.11 Semiconductor or Injection Lasers 

Semiconductor materials can be doped to produce an excess or a 
deficiency of electrons. Some lattice positions in a suitable crystalline 
material can be occupied by atoms containing more electrons than 
those in the other positions in the lattice. This process gives an 
n-type semiconductor. The well-known examples are silicon and 
germanium, both quadravalent intrinsic semiconducting materials. 
If silicon (or germanium) is doped with a very small percentage of 
pentavalent impurity (such as arsenic, phosphorus or antimony), 
the donor atoms in the lattice will provide excess electrons because 
five valence electrons are available per atom as against four for 
silicon. 

The opposite process occurs in which some lattice sites of the 
semiconductor material are occupied by atoms containing few 
electrons : this leads to a p-type semiconductor containing positive 
holes. An example is silicon (or germanium) doped with a trivalent 
impurity such as indium, aluminium or boron. At lattice locations 
where an indium atom is present instead of a silicon one, only 
three valence electrons are available instead of four. The positions 
of electron deficits are the positive holes. 

If a junction is made between a p-type and an n-type semicon- 
ductor, the application of a p.d. of the correct polarity will cause a 
flow of electrons from the n-region to the p-region. The electron 
surplus in the n-region occupies an energy band called the conduc- 
tion band; the p-region of electron deficiency occupies the valence 
energy band. These two bands are broad, because each band 
comprises a continuous energy spread over a fairly wide region. 



COHERENCE, OPTICAL MASERS AND HOLOGRAPHY 345 

To obtain laser action, stimulated emission — and so a high 
population in a metastable level ready for transition to a compara- 
tively empty and non-metastable state — is demanded. The process 
in semiconductors is very complicated and not fully understood. 
In outline the probable operation is as follows. A high-pulsed 
current (about 1,000 A cm -2 ) is passed through a semiconductor 
junction from n-material to p-material. A large number of elec- 
trons may then be visualised as passing into the p-region and estab- 
lishing themselves in a virtually metastable state. From this virtual 
state they fall to the valence level with the emission of coherent 
photons. The active region is very narrow: about 10~ 4 cm wide 
on the p-side of the junction. 

An example of this process is in the gallium arsenide laser. 
Tellurium is added to replace some of the arsenic atoms and create 
an n-region, whereas zinc is added to replace some of the gallium 
atoms to form a p-region. Typical dimensions are 0- 1 mm X 0- 1 mm X 
1-0 mm in the final cut and polished crystal (Fig. 9.14). 



- BEVELLED SIDE SURFACES 



, ACTIVE REGION 



'SACK AND FR0KTARE 
PARALLEL POLISHED 
EMITTING SURFACES 



BASE ELECTRODE (GOLD PLATED) 

Fig. 9.14. Outline of a gallium arsenide injection laser 

Semiconductor laser materials can have high efficiencies of 
about 50%, as compared with 5% for a helium-neon gas laser. 
High-power outputs with extreme brightness are possible. A signif- 
icant feature as compared with other laser methods is that modu- 
lation of the light output is possible. 



9.12 Q-switched or Pulsed Lasers 

It is possible to increase considerably the output of a laser by a 
process known as Q-switching. Substantial stimulated emission 
is delayed until a much greater population inversion than usual is 
obtained. Stimulated emission is then allowed to take place sud- 
denly, resulting in a pulse of coherent light of considerably increased 



H 
D 
a. 

D 
O 

Q 

Z 






*5 

1 

«°< 

a 

a 
O 


He-Ne (continuous): 
6,328; 11,532; 33,900 

He-Ne (pulsed) : 
11,177-12,066 


Ruby (pulsed): 6,943 
Neodymium (pulsed): 
10,600 




a 

3 
U 

Q, 

CA 
</3 

3 
O 

3 

_e 

c 
o 
O 


Electric field in 

focused spot of 

51 diameter 

(V cm" 1 ) 


8 

en 

1 
O 
m 


•a 

2-S^ 
~' i2 
2 * 
a 


© 
O 


r— 

o 


Angular 

divergence 

(sr) 


1 

o 


•a 
7 If 

a 


1 

o 


»n 


>3 


o 


10*-10 B 

(Q-switched : 
10 9 ) 


IB 

o 


© 


6n 


09 

O 
J 

o 


10'-10 u 

(Q-switched: 

10") 


© 

2 


8 


a 

It 
u 


o 

T 

o 
© 


i hi " 

©ol 


o 
"T 

m 


o 


Source material 
and excitation 


Gases (electron 
impact pump- 
ing) 


Solid state 
(optical pump- 
ing) 


o 

•2-E" 
u 


Carbon arc as 
typical optical 
source 



COHERENCE, OPTICAL MASERS AND HOLOGRAPHY 347 

intensity suitable, for example, for optical welding. In one method, 
the beam is interrupted by a mechanical shutter or rotating prism 
placed between the end of the lasing material and one of the 
mirrors. The shutter is connected to the flashlamp mechanism, so 
that the flash occurs whilst the shutter is closed. Lasing cannot 
take place until the shutter opens; meanwhile, the population in- 
version builds up to its peak value. When the shutter opens the 
energy is released in a giant pulse. 

In another method, a bleachable solution is interposed in place 
of the movable shutter. For example, a cell containing phthalo- 
cyanine is used with a ruby laser. This cell absorbs strongly light 
of the ruby output frequency. Lasing cannot occur in the ruby 
until a substantial inversion has built up. Coherent light is initially 
faintly transmitted and bleaches the solution, which then suddenly 
becomes transparent allowing the passage of a large light pulse. 
The molecules in the solution then revert to their normal state, 
so the solution becomes red and the process repeats itself. 



9.13 Application of Lasers 

The account of optical masers or lasers that has been given is 
merely a survey of the more common lasing materials and methods 
of excitation, and is by no means comprehensive. The comparative 
properties of the output from the various materials used are sum- 
marised in Table 9.2. 

The marked advantages of lasers lie in applications requiring a 
high degree of coherence in the output radiation, for example in 
holography. An obvious limitation is the number of frequencies at 
present available in the ultra-violet region of the spectrum. Stability 
of the output is also a problem in many applications. The appli- 
cations are numerous. Broadly, they fall into five main divisions : 

1. Measuring techniques taking advantage of the high coherence 
path length and, in some cases, the short pulse time and high 
intensity of the laser beam; 

2. Modulation and mixing; 

3. Power applications; 

4. Communications; 

5. Coherent-light image and data processing. 

9.13.1 MEASURKSIG TECHNIQUES 

A good example of the application of a laser to measuring tech- 
niques is the use of the Twyman and Green interferometer (Section 
10.4 of Volume 1) for testing optical components. With ordinary 



348 



UNIVERSITY OPTICS 



sources of light, the collimation requirements are very stringent; 
the use of a laser beam leads to simplification and increased accu- 
racy. A typical set-up for testing a prism is shown in Fig. 9.15: it 
gives, in the focal plane of the camera, a two-dimensional Fourier 
distribution corresponding to the phase modulation of the reference 
beam by the beam returning from the prism under test. 




MIRROR 



CAMERA 



Fig . 9.15. Arrangement for using a laser source to test a prism in a Twyman and 
Green interferometer 



Distance measurements in surveying can be undertaken by using 
a laser pulse of light and adopting the principle utilised in radar 
of determining electronically the time interval between the outgoing 
and returning pulses of light. 

Ruby lasers and Q-switched lasers have facilitated several devel- 
opments in high-speed photography. 

The laser is not generally used as a primary frequency standard, 
because the laser frequency is chiefly determined by the cavity 
dimensions. It is, however, useful as a relative frequency standard. 
The output from a given laser can be combined with some other 
optical frequency (also from a laser), and the difference found to 



COHERENCE, OPTICAL MASERS AND HOLOGRAPHY 349 

within a few hertz. It is probable that, with improved cavity tuning, 
the laser will become sufficiently stable to form a primary frequency 
standard. 

There are also many possible applications in spectroscopy. The 
most obvious one is the exploration of the Raman effect, which 
presents difficulties with standard optical sources; indeed, the 
science of Raman spectroscopy is currently undergoing a revolution 
following the advent of the laser. 



9.13.2 POWER APPLICATIONS 

In power applications, a Q-switched laser beam is focused to a 
spot of about five wavelengths diameter, giving typically outputs 
up to megawatts at pulse rates up to 100 per second. Such intense 
pulses of light may be used for making very small holes in metals 
and other materials; for the local volatilisation of small quantities 
of material; for welding in engineering; for welding in position a 
detached eye retina; and, possibly, even for providing a sufficiently 
great highly localised temperature to allow a nuclear fusion (thermo- 
nuclear) reaction to proceed. 

The focused beam will also probably be used in the future for 
the reading and writing of optical information; this will require a 
means of modulating precisely the position and intensity of the 
beam so as to produce data storable in a suitable material. 

9.13.3 COMMUNICATIONS 

Laser beams have obvious advantages over microwaves and con- 
ventional light sources in the field of communications: laser beams 
can be directed much more accurately, and have greater monochro- 
maticity and, potentially, greater frequency bandwidths. It is pos- 
sible, for example, to record a pulsed laser beam, returning after 
reflection at the Moon's surface, over an area of some two square 
miles. This illustrates dramatically the spatial coherence properties 
of the beam. 

As an example of the potential information-carrying possibilities 
of the laser beam, 10 10 channels each 10 kHz wide (corresponding 
to the usual audiofrequency range used in radio for modulating 
a carrier wave) could be accommodated on a carrier wave of fre- 
quency 10 14 Hz. However, modulation of light beams from a laser 
is a difficult problem. The modulation could be superimposed at the 
generating lasing material or after leaving the laser. Electro-optical 
devices such as the Kerr cell appear to be the most promising for 
such modulation. 



350 UNIVERSITY OPTICS 

9.13.4 IMAGE AND DATA PROCESSING 

The essential of image and data processing is the formation of 
a Fourier transform intensity distribution. For example, in Fig. 
9.16, a Fourier transform of the two-dimensional transparency is 
formed in the second focal plane of the lens 1. This hologram (Sec- 
tion 9.14) may be treated in any way desired to remove, for example, 
offensive background detail. Alternatively, a spatial filter placed at 
the hologram position will give an image formed by lens 2, as 
shown in Fig. 9.16, with the features removed as desired. Radio 



LASER 
LIGHT 



HOLOGRAM 



LENS I 



•i- 
LENS 2 



IMAGE 



TRANSPARENCY 



Fig. 9.16. Outline showing the formation of a hologram of an object in the form 
of a transparency with incident laser light 

photographs usually have a background forming a regular two- 
dimensional lattice of spots. The hologram then shows a similar 
lattice with inverse dimensions. The larger scale means that a spa- 
tial filter can easily be made to remove this undesirable lattice fea- 
ture from the hologram. 



9.14 Holography 

Suppose a plane wavefront of monochromatic light is incident 
upon an object. Light will be reflected (scattered) by the object, or 
transmitted if the object is transparent though it may contain re- 
gions of various optical densities. If this reflected or transmitted light 
is focused by a lens to form an image, as in a camera, the image is 
a two-dimensional replica of the object. If a lens is not used (i.e. 
a 'lensless' camera is used), implying that the light as modulated by 
the object falls directly upon a photographic plate or film, a diffrac- 
tion pattern is recorded. This recorded diffraction pattern contains 
only information about the intensities in the diffracted light; it does 
not contain information about the phase. To put this in another 
way, using the concepts of Section 6.23, a Fraunhofer diffraction 
pattern is, to a first approximation, the Fourier transform of the 



COHERENCE, OPTICAL MASERS AND HOLOGRAPHY 351 

diffracting aperture (the object in the present case). The record of 
the diffraction pattern will give no information about the phase of 
the Fourier transform. 

If this phase information could be recorded also, the record of 
the light from the object would then contain all the information 
which goes to make up the image of the object. This data is present 
in the light, irrespective of whether or not a lens is used to form the 
usual image. To obtain in the record the necessary phase informa- 
tion and yet use a 'lensless' camera, the technique adopted is to 
direct into such a camera monochromatic light which has been 
modulated by the object, and at the same time provide monochro- 
matic light of the same kind to enter the lensless camera directly. 
The photographic record is then of the light modulated by the ob- 
ject added to which is reference unmodulated light of the same 
wavelength. 

A photographic transparency which is obtained in this way then 
has an optical density variation decided by both the intensity and 
the phase of the modulated light from the object. This photo- 
graphic record is called a hologram. 

A beam of monochromatic light is then transmitted through 
the hologram. A second diffraction pattern is thereby generated. 
A component of this second pattern on a screen reproduces an 
image of the original object on the screen. 

Holography requires a source of radiation with a high degree of 
coherence and stability in time. Both these requirements are satis- 
fied by the laser. Moreover, the intensity of the laser light output 
enables the hologram to be recorded in a suitable short time, so 
reducing the effect of any movement. 

9.15 The Recording Process in Holography 

As an example of holography, collimated monochromatic light 
from a laser is divided into two beams. One of these beams enters 
the lensless camera directly: in Fig. 9.17(a) it is incident on the aper- 
ture OA at which there is a photographic plate. The other beam is 
incident upon the object and gives rise to a scattered beam modu- 
lated in a manner dependent upon the structure of the object. This 
scattered light also enters the aperture OA, and the hologram re- 
quired is recorded on the photographic plate at OA. 

Suppose the reference collimated beam 1 of light of wavelength 
X is incident normally at the aperture OA whereas the modulated 
beam 2 of light of the same wavelength is incident at a small angle 
<£ [Fig. 9.17(b)]. The amplitude of the electric vector in the incident 
beam 1 will be constant over the aperture OA. Let this amplitude 
be E . 



352 



UNIVERSITY OPTICS 

.BEAM-SPLITTER 



REFERENCE BEAM 
OF LI6HT 




OBJECT 



PHOTOGRAPHIC PLATE 
IN APERTURE OA 
RECORDS HOLOGRAM 



LIGHT MODULATED 
BY OBJECT 



(a) 



BEAM I 




Fig. 9.17. The principle involved in holographic recording 



Assuming first that the wavefront in beam 2 is also plane, the 
amplitude over ON drawn perpendicular to XA will be constant. 
For any point P in the aperture with coordinate x referred to the 
origin O, beam 2 will have a phase difference relative to beam 1 de- 
cided by (2a/X)MP, where MP = x sin <f> — xcj), as <f> is small. 
The complex amplitude of the electric vector E% at a point such as 
P in the aperture OA will therefore be represented by 

E 2 = E02 exp (jfcx) (9.14) 

where k = 2jt/A. 

However, beam 2 is modulated by the structure of the object. 
Consequently, £02, the amplitude of E 2 at x = 0, will be a function 
of x, taking into account only variations in the object in the x- 
direction (this assumes one dimension for simplicity ; in practice, two 
or three dimensions are concerned), and therefore £ 2 is put as 
E(x). Moreover, will vary with x, because the angle <j> depends on 
the direction of beam 2 which will also alter with the object struc- 
ture. Hence 4> is written as #(x). The appropriate expression in place 



COHERENCE, OPTICAL MASERS AND HOLOGRAPHY 353 

of Equation 9.14 for a beam 2 modulated by the object is therefore 
E 2 =E(x)exp[Mx)] (9-15) 

The intensity of the light due to the superposition of the reference 
beam 1 and the modulated beam 2 in the plane of the aperture OA 
is therefore given by 

I(x) = constant y^\E +E 2 I 2 

= constantx{|£'o)l 2 +|£'(x)| 2 +2[£'o| \E(x)\ cos [kx+(j>(x)]} 

(9.16) 

Equation 9.16 represents the intensity variation with x in the 
pattern of light that is photographed in making the hologram. This 
photograph is in the form of a positive transparency. As this posi- 
tive transparency (hologram) is to be subsequently viewed by trans- 
mitted monochromatic light, its transmittance T(x) as a function of 
x is of interest. The intensity of the light incident on the photo- 
graphic emulsion is I(x). From the data in Section 5.30 on y of 
a photographic emulsion, assuming the exposure is correct so that 
the straight-line portion of the characteristic curve (Fig. 5.23) is 
utilised, 



log Y(-r = D = y log (exposure) oc y log \I{x)\ 



Therefore 



or 



1 oc [/(*)]* 



T(x) 



T(x) oc [/(*)]-* 

The amplitude distribution in the hologram transmittance is there- 
fore given by 

V[T(x)] oc [/(*)]-"» 
Substituting for I(x) from Equation 9.16, 
V[T(x)] = a{|£ | 2 +|£(x)| 2 +2|£o| \E(x)\cos [kx+4>(x)]}-n* 



= a\Eo\-rll-± 



y 



\E(xW 

2 



2' |£ 

\E(x) 

—y- 

where a is a constant. 



y '7^' cos [kx+ <^W1 [ approx. 

■Col J 



354 



UNIVERSITY OPTICS 



As E is constant for a given reference beam and y is a constant 
for a given photographic emulsion and development, 

V[T(x)] = b{2\Eo\*-y\E(x)\* 

-2y\E \\ E(x) [ cos [kx+ (j>(x)]} approx. 

where b is a constant; or 

V[T(x)] = 6(2 \E \*-y \E(x)\*-y \E \ \E(x)\ exp {j[kx+<l>(x)]} 

-y \E \ \E(x)\ exp {-j[fec+#c)]» (9.17) 

Equation 9.17 is a mathematical representation (in one dimension, 
as variations of x only are discussed) of the hologram produced. 
It contains the phase as well as the amplitude information. The term 
'hologram' was introduced in 1951 by Gabor, to whom the original 
process is credited. 



9.16 The Reconstruction Process in Holography 

If a beam of monochromatic light of constant amplitude with a 
plane wavefront from a laser is incident upon a hologram represen- 
ted by Equation 9. 1 7, the transm itted beam has its amplitude modu- 
lated because the incident amplitude is multiplied by VIT(Xfl on 
transmission. 

y|E |exp(jkx)|E(x)|exp[j4>(x)] 
GIVES VIRTUAL IMAGE 



UNOEVIATEO BUT 
ATTENUATED SEAM 




THREE-DIMENSIONAL 
RECONSTRUCTION Of 
ORIGINAL OBJECT 



y|Eol«P(-Jkx)|E(x)|cxp[-j*(x)] 
Fig. 9.18. The reconstruction process in holography 



COHERENCE, OPTICAL MASERS AND HOLOGRAPHY 355 

The first two terms 2 l^ol 2 and -y \E(x)\ 2 in Equation 9.17 give 
rise to an undeviated but slightly diffracted beam with uniform 
attenuation as part of the transmitted light. The Fourier transform 
of a constant is a Dirac delta function (see Appendix) correspon- 
ding to the image of this beam at infinity. The third term -y\E \ 
\E(x) | exp {j [kx+<p(x)]} in Equation 9.17 gives rise to a part of the 
transmitted light in the form of a beam deviated upwards — Fig. 
9.18 where the upward beam is exp Qkx) — and isy\E \ exp (jkx) 
multiplied by \E(x)\ exp [j<K*)l- This upward deviated beam is 
therefore modulated by the original beam from the object and 
forms a virtual image of the object in this direction. 

The final term -y\E \ \E(x)\ exp {-j [kx+^x)]} in Equation 
9.17 gives a part of the transmitted light in the form of a beam 
deviated downwards, in which y|£o| exp (— jkx) is modulated by 
E(x) exp [— j<£(x)]. This is the complex conjugate of the wavefront 
arising at the hologram from the original object. 

Since the modulation is by the complex conjugate, this wave in 
effect retraces the steps back to the original object as it progresses. 
This can be seen from the example represented by Fig. 9.19. As the 
spherical wavefront from the point source S moves from A to B, 
its phase angle decreases— which is equivalent to multiplication by 
exp(-jfcr). If, however, multiplication is by exp (jkr) (the complex 
conjugate), this moves the wavefront backwards in time from B 
[with an arbitrary phase exp (ycot)] to A. 

Thus a real three-dimensional image of the original object is 
formed (Fig. 9.18) from this wavefront represented by the final 
term of Equation 9.17. 



exp(jwt)exp(jkr)' 



. U — : > ► 

S I A B 



exp (jwt) __^ A 

"~~~~-» exp ( jwt) exp (-jkr) 

Fig. 9J9. A spherical wavefront moving from AtoB is equivalent to multiplication 
by exp (—jkr) ; movement backwards from Bto A is equivalent to multiplication 

by exp (jkr) 



356 UNIVERSITY OPTICS 

9.17 The Magnification Due to the Holographic Process 

Consider two points in the object for which a hologram is made. 
If this hologram is subsequently illuminated by light of the same 
wavelength as that used in making the hologram, the separation of 
these points in the image formed by the hologram is the same as in 
the original object. The linear magnification is unity. 

If, however, the hologram is illuminated by light of different 
wavelength from that used to make the hologram, magnification is 
achieved. 

An expression for this magnification can be obtained in a partic- 
ularly simple way by considering the hologram formed by a 
Young's double slit arrangement (Fig. 9.20). The incident laser 



P 



-►I , 

" — 5LIT ' ,. HOLOGRAM 



LASER BEAM . 
OF LIGHT , 



r 



y 



i 

-SUT2 | 

1 



li 



Fig. 9.20. A hologram with Young's double-slit arrangement 

light may be considered to be divided into two beams by the slits, 
where slit 1 forms the reference beam and slit 2 the object. If the 
reference beam from slit 1 is represented by 

Ej = E exp (jkx) 

then the object beam from slit 2 is represented by 

E 2 = E exp (— jkx) 

The intensity in the plane of the hologram at OA is proportional 

to \E 1 +E 2 \ 2 = \E\ 2 , say, where 

\E\ 2 = |£ [exp jto+exp (-jfoc)J | 2 = 4 |.Eo| 2 cos 2 kx (9.18) 

The fringe separation is given by the usual Young's double slit 
expression (Equation 8.1 of Volume 1) 

Ax = ■— (9.19) 



COHERENCE, OPTICAL MASERS AND HOLOGRAPHY 357 

where Ax is the separation between neighbouring bright fringes, 
D is the axial distance from the plane of the slits to the plane of the 
diffraction pattern OA, I is the separation between the slits, and 
X is the wavelength of the light. 

Recording by photography to give a positive transparency of this 
diffraction pattern gives the hologram of the slit 2 in relation to 
slit 1. 

Suppose this hologram is illuminated with light of wavelength 
X'. The usual Fourier transform image corresponding to the last 
term of Equation 9.17 is obtained. At a distance D', the separation 
between the images of the slits will be /' such that 

D'X' 

Ax = —p- (9.20) 

The linear magnification m due to the process is seen from Equa- 
tions 9.19 and 9.20 to be 

/' D'X' 

One application of this magnification procedure is due to Gabor, 
the originator of holography. The hologram is formed by illumi- 
nating the object (in an electron microscope) with electrons of wave- 
length given by h\^/^2m^V\ where h is Planck's constant, m e and 
e are the mass and charge of the electron, and V is the p.d. through 
which the electrons are accelerated. With V = 10,000 V, X = 
12-26/^/V = 0-01226 A. If this hologram is viewed by light of 
wavelength 5,000 A, say, an enormous magnification of about 
5xl0 5 is possible. 

The possibility exists also of obtaining such enormous magnif- 
ications in X-ray microscopy where the X-rays have a wavelength 
of 0-1 A, say, and the light used to illuminate the hologram is at 
a wavelength of 5,000 A. At present, however, X-ray beams with 
the necessary coherence are not obtainable. 



Exercise 9 

1. Write an essay on the coherence of radiation. 

2. With the aid of a three-level energy diagram in which an appropriate level is 
metastable, explain how population inversion can be obtained leading to 
stimulated emission of radiation. 

3. With the aid of suitable diagrams, describe in outline the construction and 
operation of a pulsed ruby optical maser. 

4. Explain, in relation to an appropriate energy-level diagram, how stimulated 
emission results from electron impact pumping in a helium-neon gas laser. 



358 UNIVERSITY OPTICS 

5. Write an essay outlining some of the present and possible future applications 
of optical masers. 

6. Without giving mathematical detail but with the aid of simple ray diagrams, 
describe the principles involved in obtaining a hologram of a transparent 
object. 

7. Describe the holographic process in simple mathematical terms by reference 
to the formation of a hologram from either a Fresnel double-mirror system 
or a Young's double-slit arrangement. Explain briefly how large linear 
magnifications are possible in holography, indicating the limitations in- 
volved in the practical use of two sources of light of greatly differing wave- 
lengths, where one source is used to record the hologram and the other in the 
reconstruction of the image. 

8. An aperture is filled with a plane wavefront of monochromatic laser light 
of amplitude E combined with light whose complex amplitude is represen- 
ted by E t = E(x) exp \jj>(x)] ; this latter light comes from the same laser as the 
monochromatic light, but is scattered by an object which is considered, for 
simplicity, to modulate this light in an x-direction only. The pattern of 
light formed in the aperture is photographed to produce a hologram utilis- 
ing a photographic emulsion of which the slope of the characteristic curve is y. 
Derive an equation which represents the variation with x of the amplitude of 
the light transmitted by this hologram when it is illuminated with mono- 
chromatic light. Explain how a component of this transmitted light gives 
rise to a real image of the original object. 



Appendix 



A.l Fourier's Theorem . 

Let/(x) be a periodic function in x with a period a and frequency 
v = \/a. According to Fourier, such a function can be represented 
by a series of sine and cosine terms in the form 

fix) = Ao+Ai sin 2nvx+A 2 sin 4nvx+ ... +A„ sin 2npvx 

+ ... +Bi cos 27ivx+B 2 cos Anvx 

+ ... +B p cos2jipvx+ ... 
or 

p=N 

f(x) = Ao+Y. (4 P sin 2jipvx+B p cos 2jipvx) (A.l) 
p=i 

where A , A p andB p forp = 1,2,3, ...,N are coefficients to be de- 
termined. 

The values of the coefficients A , A p and B p may be found by 
using the orthogonality relationships of sines and cosines, which are : 

r" « /i ,„ fn for B = o 

cos/>0cos$0d0 = \ * (A.2) 

o I for p ^ q 

J sin/>0sin$0d0 = \ n * (A.3) 

o [ for p ?± q 

J cos pO sin ^0 d0 = (A.4) 

o 

Thus, if both sides of Equation A.1 are multiplied by cos 2nqvx and 
integration is carried out over a period, then for q =s N 

a a 

jf(x) cos Inqvx dx = A j cos Inqvx dx 
° 

p=N a 

+ X J (-Ap sm 2jq»vx cos 2nqvx+B p cos 27tpv.x; cos 2?r^vx) dx 

P=10 

359 



360 UNIVERSITY OPTICS 

In Equations A.2, A.3 and A.4, putting 6 = 2jcvx, the period a is 
equivalent to angle 2n. Therefore, in view of Equations A.2 and 

p=N 

A.4, when p = q all terms within £ disappear except for the 

P =i 
term B p cos Inpvx cos 2nqvx. Thus 

a a a 

j f(x) cos 2nqvx dx = A J cos 2^vx dx+ J 2?^ cos 2 Inpvx dx 
o oo 

Now, 



1 



, sin 2jtqva . 

cos 2nqvx dx = — = = 

27iqv 



o 
and, from Equation A.2, 



a 

\ 



cos 2 2npvx dx = y 



Therefore 



so 



a 

I 



/(x) cos 27r^rx dx = — ~ 



a 



) cos 2nqvx dx (A.5) 



Similarly, multiplying both sides of Equation A.l by sin Inqvx, it 
follows that 

A p = — f(x) sin 2nqvx dx (A.6) 



-lh 



To find y4o, integrate Equation A.l from to a to give 

a a 

{/"(*) dx = j A dx = Aoa 
o o 

because all other terms in Equation A.l become zero in view of the 
relationship 

In 2" 

} sin/>0d0 = j" cos/>0d0 =0 



APPENDIX 361 

Therefore 

a 

A = ^(f(x)dx (A.7) 

o 

Equations A.5, A.6 and A.7 are used to evaluate the constants B p , 
A p and Ao respectively. 

A.2 The Complex Fourier Series 

As 

cos Inpvx = y [exp (j2jipvx)+exp (-j2npvx)] 

and 

sin 2npvx = -y [exp (j2?rpvx)-exp (-j2npvx)] 

a pair of terms in Equation A.l such as A p sin 2nvpx+B p cos 2npvx 
may be written as 

A 

-=£ [exp ()2npvx) — exp (— j2npvx)] 

+ -*- [exp Q27ipvx)+exp (-jlnpvx)] 

= C p exp (J27ipvjc)+C_p exp (~']2npvx) 
where 

C, = 1 (y +*p) = y (^-J^) (A.8) 

C_ p = I (~y +*#) = J Wr+Bp) (A.9) 

Equation A.l can therefore be written in the form 

fix) = "f {[exp (j27ipvx]Cp} (A. 10) 

p=-N 

where p = -N, ~(N- 1), . . . , - 1, 0, 1, . . . , N, and where it is seen 
from Equations A.8 and A.9 that the values of C p are, in general, 
compl c x- 



and 



362 



UNIVERSITY OPTICS 



To find the coefficients in Equation A. 10, the orthogonality rela- 
tionship used is 



Jexp(jp0)(aip(-jp0)d0 = j 
o 10 



for p — —q 
for p t± -q 



Multiplying both sides of Equation A.10 by exp (-jlnpvx) and 
integrating over the range to a gives 



4f- 



) exp (-'jlnpvx) dx 



(AM) 



The coefficients in Equation A.10 may be represented by a Fourier 
spectrum (Fig. A.l) making use of Equation A. 11. 



REAL PART 
OF C„ 



^ 



I*'' |t»l 



7 D P 



IMAGINARY PART 
OF C p 



pv 



■]l ' " 



pv 



I*i |^*^ |l A p 



Fig. A.l. A Fourier spectrum (p is positive or negative) 

If the function f(x) is real, then 

i-'p == (-* — p 

where C- p is the conjugate of C_ p , giving a series of real terms 
C p exp (flnvpx) + C p exp ( - j Invpx). 



A.3 The Fourier Transform 

Suppose that the range of the function to be represented by a Fou- 
rier series is very large. The period a in Equation A.l 1 then tends to 
infinity, and v (the interval between the coefficients C p in the Fourier 
spectrum illustrated in Fig. A.l) tends to zero. The amplitudes of 
the various terms are now represented by a continuous function, 
in the limit (Fig. A.2). The coefficient C p represents the contribution 
to the amplitude between frequencies pv and pv ± dv and is expressed 
by 

C p = F(v)dv 



APPENDIX 

Equation A. 11 is therefore written in the form 



363 



a 

F(v)dv = ^f(x 



) exp (— }2rcpvx) dx 



Let dv -* and a — °° , representing a range from — oo to 
Also, write pv as a general frequency v. Then, 



F(y) = limit 
«— o 



M«) J J 



fix) exp (—}2:ipvx) dx 



or 



f (") = J /(*) ex P (~}2nvx) dx 



(A. 12) 



Note that both/(x) and F(x) are generally complex. 

Equation A. 12 is true if the functions and their first and second 
derivatives are continuous. The function F(y) is called the Fourier 




Fig. A.2. Diagram of the Fourier spectrum where the interval between 
coefficients tends to zero 



transform of f{x). The Fourier transform of any function is there- 
fore simply the amplitude distribution of the continuous Fourier 
series which represents that function. The converse is also true, i.e. 



f(x) = j F(v) exp (jlnvx) dv 



(A. 13) 



where the exponential sign is strictly reversed but, in practice, this is 
arbitrary. 

The functions f(x) and F(v) are known as Fourier mates, and 
f{x) ^ Fiv). 



364 UNIVERSITY OPTICS 

A.4 Examples of Fourier Transforms 

A common example is the Gaussian function [Fig. A.3(a)]. It is 
given by 



f(x) = h exp 



("3 



RA.14) 




Fig. A3. The Gaussian function and its Fourier transform 
Therefore, applying Equation A. 12, 

F(v) = \h exp | - ^ J exp (- 'flnvx) dx 

;= h exp (-TrVff 2 ) exp J -+J7zvo\ dx 



APPENDIX 



365 



As 



therefore 



| exp (— z 2 ) dz = y/ji, 



F(v) = A(T V^ [exp (-TtVtr 2 )] 



(A.15) 



Equation A.15 is also a Gaussian function [Fig. A.3(b)], but note 
that the width parameter is proportional to l/<x, which is generally 
true of Fourier transforms. 



f(x) 



W 




(b) 
Fig. A.4. The 'top-hat'' function and its Fourier transform 



Another example is the 'top-hat' function [Fig. A.4(a)], where 
f(x) = h for —a/2 «= x ■<= a/2 and is zero elsewhere. Therefore, from 



366 UNIVERSITY OPTICS 

Equation A. 12, 



+ o/2 

-J*. 



F(v) = I h exp (-'ftjivx) dx — ^ 



<t> 

-a 12 

where <j> — nav. 

The function F(v) has a first minimum at <j> = n = jrai>; hence 
v = I/a [Fig. A.4(b)], so again the inverse width relationship has 
been obtained. 



A. 5 The Dirac Delta-function 

The area under the Gaussian curve [Fig. A.3(a)] represented by 
Equation A. 14 is given by 

A = h exp | — ~\dx = ha \/n 

— oo 

Hence if h = Ijay/n,, A=\. The Dirac delta-function h(x) may be 
defined as the limiting Gaussian function when A-» °° and a^-0, 
subject to the normalising condition that A = 1. This function is 
zero except when x = 0. It has special properties, a particularly 
useful one being that 

fd(x)f(x)dx=f(0) (A. 16) 

and 

j" d(x-a)f(x)6x=f(a) (A.17) 

— oo 

From Equation A. 15, the Fourier transform of d(x) is 

limit /— V| a V™ lexp ( -TrVff 2 )] = 1 (A. 18) 

a — o yaynj 

In general terms, therefore, the Fourier transform of the delta- 
function is a constant, and vice versa. So Dirac delta-function^con- 
stant. 



Numerical Values of Constants 



Atomic mass unit (a.m.u.) = 1-6598X 10 -27 kg 
Avogadro's number (N) = 6-02486 X 10 23 molecules mol -1 
Bohr magneton (/S) = 1-16544 X lO -29 Wb m 
Boltzmann's constant (k) = 1-38044 X 10 -23 J deg -1 C 
Electron, charge of (e) = 1 -60206 X 10~ 19 C 
Electron, rest mass of (m e ) = 9-1083X 10~ 31 kg 
Electron, specific charge of (e/m e ) = 1-7589X 10 11 C kg -1 
Electron volt (eV) = 1 -60206 X lO" 19 J 
Hydrogen atom, mass of = 1-6733 X 10 -27 kg 

= 1-008142 a.m.u. 
Permeability of free space (/x ) = 4jtX 10~ 7 H m" 1 
Permittivity of free space (e ) = 8-85424X 10~ 12 F m _1 
Planck's constant (h) = 6-62517X lO -34 J s 
Proton, rest mass of (m p ) = 1 -67239 X 10~ 27 kg 

= 1-007593 a.m.u. 

= 1,836-12 m e 
Rydberg constant for infinite mass (R^) = 1-0973731 X 10 7 m _1 
Velocity of light in free space (c) = 2-99793 X 10 8 m s _1 



367 



Answers to Numerical Exercises 



Exercise 1, Page 32 

7. Reflected intensity 0-1722 times incident intensity 
8.1/9 
11. 109-5 Vm" 1 



Exercise 2, Page 90 

2. (a) 3X 10-' m = 3,000 A; (b) 8-486X 10 5 m s" 1 

3. (a) 6-546X10-' m = 6,546 A; 4-854X10-' m - 4.854A; 
4-339 X 10-' m = 4,339 A; (b) 13-67 eV 

4. 5-3XlO-"m = 0-53 A 

5. 13-67 eV; 5-3X10" 11 m = 0-53 A 

7. l-8Xl0 11 Ckg" 1 

9. 2-15 A in wavelength on either side of the line of wavelength 5000-0 A 



Exercise 3, Page 147 

3. 1-732 

5. l-64xl0- 3 cm 

6. Light is plane-polarised 
9. l/\/3;30 o 

10. (a) plane-polarised: ft>) elliptically polarised 

11. 7-2 mm; 2-4 mm 

13. 4180 A, 4500 A, 4910 A, 5460 A, 6090 A and 6920 A 

15. 130 

18. 1-3°; 3-6X10- 8 

19. For left-handed rotation: rotations of 36", 216°, 396° and 576° correspond 
to one-third previous concentration. Estimates of rotations for full concen- 
tration are therefore 108°, 648°, 1188° and 1728°; these are the same as 108°, 
288°, 108°, 288°, of which 288° s 72° right-handed rotation. Therefore 
a left-handed rotation of 36° is, in fact, 216°, corresponding to 648° & 
= 288° = 72° right-handed. To avoid ambiguities, alter tube length by a small 
amount. 

368 



ANSWERS TO NUMERICAL EXERCISES 369 



Exercise 4, Page 177 

5. 10 cm 

7. 25-46 days 

8.1-25X10- I0 cm 

9. 202X10 -5 cm or 404X10- 5 cm 



Exercise 5, Page 246 



5. Mean wavelength = 5788 A; range of wavelengths = 68 A 
11. 6235 A and 6061 A 
16. 50 cm length in first order; 25 cm length in second order 



Exercise 6, Page 295 

1. 17-32 cm or 17-32 ■s/n cm where n = 3 or 5 

2. Bright image at distance beyond zone plate of 31-6 cm, with fainter images 

at 19-4 cm and 14 cm; much fainter images in between these positions 
4. 543 cm; 45-5 cm; 11-22 cm 
7. 6«65° 



Exercise 7, Page 311 

4. 0-2, 0-3 and 0-5 



Exercise 8, Page 323 

1.3-57Xl0~ 5 cm 

6. (a) 3-23x10-' cm; (b) 2-26X10" 5 cm; (c) 3-35X10-' cm 



Index 



Abbe equation for limit of resolution 
of microscope 316 

Abbe theory of microscope 312, 314 

Absorbance 228 

Absorptiometer, prism, Unicam 235 

Absorption coefficient 227 

Absorption of light, electromagnetic 
theory of 26-27 

Absorption spectra 27, 29, 80, 188, 
232,238 

Absorption spectrophometers 227, 
235, 238-241 

Absorption wavelengths, characteris- 
tic 241 

a.c. arcs 186 

Alkali metals 50, 55-56, 181 

Ampere's circuital law 2, 4 

Amplitude 
and frequency incoherence 325 
division of 284-285 

Amplitude object 316 

Analysis of materials 179 

Angular dispersion 219-220 
of diffraction grating 197-198 
of prism 197 

Angular momentum 48, 52, 71, 72 

Angular velocity of precession of el- 
liptical orbit 72-73 

Anharmonicity constants 81 

Annular aperture 320 

Anomalous dispersion 30 

Anti-Stokes lines 85 

Aperiodic method 246 

Aperture function 291-293 

Arc sources 184-186 

Argon 181-183 



Astronomical applications of spec- 
troscopy 241-242 
Atmosphere, carbon dioxide content 

239 
Atomic beam sources 184 
Atomic spectra 37 
Atoms 

electron states of 49 

energy radiation 39 

energy required to excite 42 

in gas or vapour 180 

in solid state 179 

nuclear model of 38 

spectra of, due to transitions of two 
or three electrons 57-59 
Automatic recording 236 



Babinet compensator 121-125, 144 
Babinet principle of complementary 

screens 275 
Bacterial action 239 
Balmer equation 37, 38, 41 
Balmer series 43 
Band spectra 79, 180 
Barrell 169 
Bartholinus 99 
Beam-splitter 245 
Beer's law 227, 228 
Benolt 169, 170 

Benoit method of exact fractions 162 
Beryllium 47, 57, 222 
Bessel function 275 
Biot's law 133 
Biquartz plate 135 
Bird 97 



371 



372 



INDEX 



Birefringent crystal 99, 104, 105, 106, 

109, 126 
Black-body radiator 35, 189 
Blazed gratings 208 
Bohr, Niels 35, 38, 60 
Bohr magneton 52, 76 
Bohr theory of spectrum of hydrogen 

atom 37-43, 45-46 
Bolometer 191, 195 
Boltzmann's constant 85, 180, 331 
Born 60 
Boron 58, 222 

Boundary conditions 12, 17, 18 
Brackett series 43 
Brewster angle 22, 23, 93, 95, 146, 

342 
Brewster fringes 166-169 
Brewster windows 342 
Brightness 297, 298 
Bunsen and Roscoe's reciprocity law 

231 



Cadmium 73, 169, 170, 183, 222 
Calcite crystals 99, 102, 104, 141 
Calibration of spectroscope 222- 

225 
Camera, lensless 350, 351 
Canada balsam 102 
Carbon arc 184-186 
Cauchy's equation 30, 31 
Characteristic absorption wave- 
lengths 241 
Characteristic curve for photograph- 
ic emulsion 230 
Chemical reactions, progress of 241 
Chemical structure 238 
Chromatic aberration 322 
Chromatic resolving power 215 
of diffraction grating 198 
of prism 197 
of spectrometer 225 
Chromaticity 298, 305 
Chromaticity diagram 304-306 
CLE. See International Commission 

of Illumination 
Circular aperture 
Fraunhofer diffraction due to 274- 

275 
Fresnel's treatment of plane wave- 
front in 250-254 



spherical wavefront in 255-257 
Kirchoff's theory of diffraction 
of 279-281 
Coefficient of finesse 159 
Coherence 282-284, 325, 349 
Coherence length 329 
Coherence time 329 
Colorimeter and colorimetry 300, 

309-311 
Colour 297-311 

additive mixing 298-300 

blindness 299 

desaturation 311 

introductory concepts 297 

measurement 299, 309-311 
CLE. system 302-306 

minus-greens 306 

non-spectral 306 

of surfaces illuminated by source of 
light 306 

primary 300, 302, 307 

printing processes 308 

purity 298, 306 

receptors 298 

reproduction and photographic pro- 
cesses 308 

specification 299, 306 

subtractive mixing 307-309 

temperature 231 

transparencies 308 

tristimulus additive matching 300 

vision theories 298-300 
Coma 323 

Combustion processes 239 
Communications, laser applications 

349 
Compensator 137 

in saccharimeter 137 
Complementary screens, Babinet prin- 
ciple of 275 
Complex Fourier series 361 
Concave grating. See Gratings 
Concentration of solution 238 
Constant inclination fringes 166 
Cornu-Jellet prism 135 
Cornu prism 202 
Cornu spiral 263-266, 268 
Critical illumination 313 
Cylindrical wavefront 260 

diffraction of 265-268 



INDEX 



373 



Dark-ground illumination 316, 320 

Data processing, laser 350 

Davisson 60 

d.c. carbon arc 185 

de Broglie, Louis 59 

de Broglie hypothesis 61 

de Broglie wavelength 60 

de Broglie waves 61 

Degeneracy 85 

Deuterium 47, 183 

Dextrorotatory substances 127, 132, 

137 
Dielectrics, electromagnetic theory of 

27 
Diffraction 

far-field 268 

Fraunhofer 268, 282 
due to circular aperture 274-275 
due to narrow slit 269, 291-295 
due to rectangular aperture 271 
treated by Fourier methods 289- 
291 

Fresnel 250, 269 

Kirchhoff's theory of 276, 279-281 

of cylindrical wavefront 265-268 

of electromagnetic waves 249 
Diffraction grating. See Gratings 
Dim-action pattern 244, 350 

Fraunhofer 350-351 
Dipole 24, 25, 34, 71, 144 
Dipole moment 25, 28, 144 
Dirac delta-function 326, 328, 355, 

366 
Discharge tubes 181-184 
Dispersion 

anomalous 30 

normal 30 

theory of 27-32 
Dominant wavelength 298, 306 
Donaldson colorimeter 310 
Doppler broadening 184, 189 
Doppler effect 241, 327, 333, 337 
Double refraction 99, 104, 126, 143, 

144 
Doublet lines 188, 225 
Doublet structure 189 
Dye-stuffs 239 



Eagle mounting 213 

Eastman subtractive colorimeter 311 



Echelle grating 216-221, 243 
Edser-Butler method for calibration 

of spectroscope 222-225 
Einstein 35, 38 
Einstein coefficients 87, 331 
Einstein's equation for photoelectric 

effect 36-37 
Electric charge 1-2, 34 
Electric displacement 2, 4, 27 
Electric field 8-10, 28, 79 

interauclear 82 

wave equation for 8 
Electric field strength 2, 4, 5 
Electric field vector 92 
Electric moment 27-29 
Electric spark sources 186-188 
Electrodeless discharges 184 
Electromagnetic field 2-5 

and radiation 1-2 
Electromagnetic radiation, coherence 

of 325 
Electromagnetic theory 1-33 

of absorption of light 26-27 

of dielectrics 27 

of radiation 1-33 

of scattering of light 24-26 
Electromagnetic waves 5-10 

diffraction of 249 

in uniform isotropic dielectric 5-7 

plane 7-10 
Electron resonance frequency 30 
Electron rotation 38-39 
Electron spin 50, 75-76 
Electron states of atoms 49 
Electronic energies 80-81 
Electronic spectra 82 
Electronic states of molecules 82 
Electronic transitions in organic 

chemistry 239 
Electrons 25, 27 

charge of 367 

elliptical orbit of, within magnetic 
field 71 

extra-nuclear 43-45, 54 

in elliptical orbits 48-51 

magnetic moment of 51 

reduced mass of 46 

spectra of atoms due to transitions 
of two or three 57-59 

total moment of momentum 46 
Electrostatic field 1-2, 144 



374 



INDEX 



Elliptical orbit 
angular velocity of precession of 

72-73 
electrons in 48-51 
of electron within magnetic field 
71 
Elliptical reflectors 342 
Elliptically polarised light. See Polar- 
ised light 
Emission spectra 179, 180, 188, 233 
Emulsion, photographic 191, 228-233, 

353-354 
Energy density 
of radiation 34-35 
variation with frequency 35 
Energy levels 
in helium-neon gas laser 343 
in ruby 340 
Energy-level diagram 41, 43, 56, 74- 

75 
Equal-energy spectrum 304 
Etalon, Fabry-Perot. See Fabry- 

Perot etalon 
Excitation energy 42, 86 
Extraordinary ray 99, 102, 105, 106 



Fabry 169, 170 

Fabry-Perot etalon and interferome- 
ter 70, 155-173, 175, 216, 217, 
333, 335 
comparison of standard metre with 

wavelength of light 169-171 
in series 166-169 
interference filters 171-173 
method of exact fractions 162 
with variable separation 164-166 

Fabry-Perot plane-parallel mirrors 
342 

Faraday effect 130-131, 132 

Far-field diffraction 268 

Fellgett 244 

Fery prism 235 

Films, thin, interference in 285-287 

Filters 171-173, 239, 244 

Fine structure 47, 49, 50, 237 

Fizeau fringes 287-289 

Flame sources 180 

Flash photolysis 241 

Fluorescence 25, 191, 193 

Fourier analysis 190 



Fourier integral theorem 246 

Fourier mates 291, 363 

Fourier methods, Fraunhofer diffrac- 
tion treated by 289-291 

Fourier series, complex 361 

Fourier spectroscopy 244-246 

Fourier spectrum 362 

Fourier theorem 359 

Fourier transform 289, 291, 293, 294, 
328, 350, 351, 355, 357, 362-363, 
366 
examples of 364 

Fraunhofer diffraction 268, 282 
due to circular aperture 274-275 
due to narrow slit 269, 291-295 
due to rectangular aperture 271 
treated by Fourier methods 289-291 

Fraunhofer diffraction pattern 350- 
351 

Fraunhofer lines 179, 239 

Fresnel diffraction 250, 269 

Fresnel equations 19-23, 145-147 

Fresnel half-period zones 251-260 

Fresnel integrals 260 

Fresnel theory of natural rotation 
127-130 

Fresnel treatment of plane wavefront 
in circular aperture 250-254 

Fringes 
Fizeau 287-289 
general forms of 285 
Haidinger-type 286 
locus of 286 

Fused silica 217 



Gabor 354, 357 

Gallium arsenide laser 345 

Galvanometer amplifiers 195 

Gas analysers, infra-red 239 

Gaseous discharges 239 

Gaseous specimens 238 

Gases 

inert 181, 183 

smallest measurable concentration 
239 
Gaussian amplitude profile 329 
Gaussian distribution 329 
Gaussian function 364-366 
Gauss's theorem, Poisson's analyti- 
cal form 4 



INDEX 



375 



Geissler tube 181 

Gerlach 52 

Germer 60 

Glan-Foucault polariser 102 

Globar source 189 

Glow discharge source 181 

Golay cell 191, 195-196, 236, 242, 

244 
Goundsmit 50 
Gouy 281 
Grating 

blazed 208 

concave, mountings for 211-215 

diffraction 197, 198, 204, 206, 
284 

echelle 220, 243 

echelon 216-221 

reflection 206-208, 217-221 

Rowland concave 208-211 
Grating spectrometer 204-206 
Grating spectroscopy in far infra-red 
243 



Haidinger-type fringes 286 

Half-period zones 251-260 

Half-wave plate 110-111 

Harmonic oscillators 34 

Hartmann dispersion equation 222 

Heisenberg 60 

Heisenberg uncertainty principle 63, 
68 

Helium 47, 181, 183 

High-resolution spectrometry 215- 
217 

Hilger-Nutting spectrophotometer 
233-234 

Hologram 350, 351, 353, 354, 356, 
357 

Holography 350-357 
magnification 356-357 
reconstruction process 354-355 
recording process 351-354 

Hue 297 

Huygens' construction 104-105 

Huygens' principle 249 
Kirchhoff 's formulation of 276 

Huygens' secondary source 281 

Hydrogen atom 63 
Bohr theory of spectrum of 37- 
46 



energy-level diagram for 41 
excitation potentials of 43 
orbital picture of 43 
quantum mechanical theory of 64- 
68 
Hydrogen discharge lamp 183 
Hyperfine structure 161, 169, 184, 
215, 220 



Iceland Spar 99 
Illumination 

critical 313 

dark-ground 316, 320 

KShler 313, 314 

phase-contrast 316 
Image and data processing, laser 

350 
Incoherence 284, 325 

frequency, and amplitude 325 

spatial 325 

temporal 325-326 
Inert gases 181, 183 
Infra-red gas analysers 239 
Infra-red radiation sources 188-189 
Inner quantum number 50, 57, 59, 

66,75 
Intensity distribution in spectral line 

190 
Interference 

and associated phenomena 282 

effects in uniaxial crystals 138-143 

in thin films 285-287 

Lloyd's single-mirror experiment on 
326 
Interference filters 171-173, 183 
Interference phenomena, classifica- 
tion of 284 
Interferogram 245 
Interferometer 197 

Fabry-Perot. See Fabry-Perot 

Lummer-Gehrcke 151, 173-176 

Michelson 165, 169, 245 

Twyman and Green 347 

variable-length 216 
Interferometry, multiple-beam. See 

Multiple-beam interferometry 
International Commission of Illumin- 
ation (CLE.) 302, 303, 306 
Isochromatic line 143 
Isochromatic surface 143 



376 



INDEX 



Isotopically pure material 190 
Ives colorimeter 309 



Jeans, Sir James 34 
jj coupling 59 
Johnson 322 



Kepler's laws 51 

Kerr cell 145 

Kerr constant 144-145 

Kerr electro-optical effect 144-145 

Kirchhoff 179 

Kirchhoff formula 278 

Kirchhoff formulation of Huygens* 

principle 250, 276 
Kirchhoff theory of diffraction 276, 

279-281 
Koenig298 

Kohler illumination 313, 314 
Krishnan 83 
KRS 5 material 204 
Krypton 169-171, 183, 221 



Laevorotatory substances 127, 132, 
137 

Lambert-Beer law 228 

Lambert's law 227 

Lamps, xenon 342 

Lande splitting factor 78 

Landsberg 83 

Lasers 89, 325-350 
application of 347-350 
as optical oscillator 333-335 
communications applications 349 
Fabry-Perot type cavity 335-337 
gallium arsenide 345 
gas 337, 342-344 
helium-neon gas 333, 335, 342, 343, 

345 
image and data processing 350 
materials, properties and outputs 

of 347 
measuring techniques 347-349 
origin of term 332 
power applications of 349 
principle of 331-333 
Q-switched or pulsed 345-349 
ruby 340-342, 347, 348 



semiconductor or injection 344 
see also Optical masers 
Laurent half-shade plate 134 
Laurent polarimeter 134 
Lensless camera 350, 351 
Light 
absorption of, electromagnetic 

theory of 26-27 
amplification by stimulated emis- 
sion of radiation 331 
electromagnetic theory of 1, 63, 145 
polarisation. See Polarisation 
polarised. See Polarised light 
scattering of, electromagnetic 

theory of 24-26 
speed of 6-7, 367 

unpolarised 106, 111, 112, 113, 117, 
119 
Line spectra 180, 181 
Lippich polarimeter 133 
Lippich triple-field polariser 136 
Lithium 47, 181 
Littrow plane grating spectrograph 

208 
Littrow prism spectrometer 201-204 
Lloyd's single-mirror experiment on 

interference 325-326 
Longitudinal effect 68 
Lorenz-Lorentz theory 27 
LS coupling. See Russell-Saunders 

coupling 
Luminance 298 
Luminous emittance 298 
Lummer-Brodhun photometer 310 
Lummer-Gehrcke interferometer 151, 

173-176 
Lummer-Gehrcke plate 173, 216 
Lyman 37, 38, 192 
Lyman series 43 



Magnetic dipole moment 71 
Magnetic field 1-2, 9-10, 71, 73 
Magnetic field strength 4, 5 
Magnetic flux density 4, 5, 70, 130, 

131 
Magnetic moment 51, 71, 76, 77 
Magnetic permeability 5, 7, 12, 51 
Magnetic quantum numbers 50-53, 

77 
Magnetic rotation 130 



INDEX 



377 



Magnification in holographic process 

356-357 
Malus 93 
Mandelstamm 83 
Maser 89, 90 

Maxwell-Boltzmann distribution 180 
Maxwell's equations 2-5 
Measuring techniques, laser 347-349 
Mercury arc lamp, quartz-enveloped 

243 
Method of exact fractions 162 
Method of minimum deviation 146 
Michelson 216 
Michelson interferometer 165, 169, 

245 
Microdensitometer 236 
Microphotometer 230, 236-237 
Microscope 312-324 

Abbe equation for limit of resolu- 
tion of 316 

Abbe theory of 312, 314 

dark-ground illumination 316, 320 

interference 321-322 

introduction 312 

limit of resolution 312 

phase-contrast 318-321 

phase-contrast illumination 316 

sub-stage condenser lenses 313 

ultra-violet 322 
Microscope objective 

numerical aperture of 315 

reflecting 322 
Microwaves, plane of polarisation of, 

rotation 131-132 
Mitscherlich polarimeter 133 
m.k.s. system of units 6-7 
Molar absorptivity 228 
Molar magnetic rotation 131 
Molar rotation 133 
Molecular energy levels 80 
Molecular spectra 37, 79-82 
Molecules 

diatomic 80-82 

electronic states of 82 

in gas or vapour 180 

in solid state 179 
Monochromatic radiation 245 
Monochromators 196-199, 203, 215, 
243 

constant-deviation 201 

definition 198 



Mountings for concave gratings 211- 

215 
Multiple-beam interferometry 151- 

178 
Multiplex advantage 246 
Multiplex method 245 



Narrow slit, Fraunhofer diffraction 

due to 269 
Neon 181, 183, 221 
Nernst filament 236 
Nernst glower 189 
Neumann's equation 2 
Neutron mass 25 
Nicol prism 102-104, 112, 115-121, 

123-125, 129, 132-138, 140, 141, 

145, 233, 234 
Nicol rhomb 113 
Normal dispersion 30 
Normal spectrum 198 
Nucleus 38, 45-46 
Numerical aperture 315, 323 



Obliquity factor 253 

Opacity 230 

Optic axis 99, 106 

Optical activity 126, 127, 132 

Optical density 230, 231, 236, 237, 
238 

Optical density-wavelength curve 
234 

Optical masers 325-350 
spectral line from 330 
see also Lasers 

Optical oscillator, laser as 333-335 

Optical path difference 152 

Optical pumping 89, 337-340, 342 

Optical welding 347 

Orbital angular momentum 71, 73 

Orbital magnetic moment 77 

Orbital magnetic quantum number 
52, 54, 73 

Orbital quantum number 48, 50, 53, 
57, 59, 71, 82 

Ordinary ray 99, 102, 106 

Organic chemistry, electronic transi- 
tions in 238 

Organic compounds 239, 241 



378 



INDEX 



Orthogonality relationship 362 



Parrish 97 

Particles, wave nature of 59-61 

Paschen 37, 38 

Paschen hollow-cathode source 183 

Paschen mounting 21 1 

Paschen series 43 

Pauli exclusion principle 53-54 

Penicillin solution 239 

Periodic method 246 

Periodic table 54, 55, 57, 58 

Permeability of free space 367 

Permittivity 2, 5, 12, 27, 93, 367 

Perot 169, 170 

Pfund series 43 

Phase change 23-24, 250 

Phase contrast 319-320 

Phase difference 139-141, 143, 153, 
160 

Phase object 316 

Phase plate 320-321 

Phase-contrast illumination 316 

Phase-contrast microscope 318-321 

Phosphorescence 191 

Photochemical products 241 

Photoelastic analysis 144 

Photoelectric detectors 192 

Photoelectric effect, Einstein's equa- 
tion for 36-37 

Photographic emulsions 191, 228-233, 
353-354 

Photographic plates, control 232 

Photographic recording, problems in 
232 

Photometer, Lummer-Brodhun 310 

Photomultiplier cell 203 

Photons 24, 36, 37, 63, 83, 86, 87 

Phototransducer 237 

Phototransistor 237 

Photovoltaic cell 237 

Pile of plates method 96 

Planck, Max 34, 35, 38 

Planckian distribution of radiation 
189 

Planck's constant 24, 35, 39, 63, 74, 
80, 81, 229, 327, 331, 357, 367 

Planck's equation 35, 87 

Plane of polarisation 
of microwaves 131-132 



rotation 126, 130, 145 
Plant pigments 239 
Poisson's analytical form of Gauss's 

theorem 4 
Polarimeter 111, 132 

sensitivity of 136 
Polarisation 27 
of light 92-105 
and double refraction 99 
by passage through wire grids 

97 
by reflection 93-96 
by scattering 96 
by selective absorption 93 
of reflected and refracted light 15 
plane of. See Plane of polarisation 
state of 21 

strain measurement by 143-144 
Polarised light 
analysis of 111-121 
circularly polarised 108, 111, 112, 

113, 117,143 

elliptically polarised 106-110, 112, 

114, 116-119, 124, 145 
linearly polarised 11-15, 92, 111, 

112, 113, 115, 117, 118, 119, 
145 

types of 111-112 
Polarising angle 146 
Polaroid 93, 96, 97, 112, 123, 124, 

130, 146 147 
Population 85, 87, 89 

inversion 89 
Power applications of lasers 349 
Pressure broadening 190 
Principal quantum number 39, 41, 48, 

49, 53, 54, 63, 67, 79 
Principal section 99 
Principle of detailed balance 88 
Principle of superposition 283 
Print-out mechanism 236 
Prism 197 

angular dispersion of 197 

chromatic resolving power of 197 

Cornu202 

F6ry 235 

Nicol. See Nicol prism 
Prism polarisers 100 
Prism testing with laser source 348 
Probability concept 64, 67 
Proton mass 25 



INDEX 



379 



Q-switching 345 
Quantum mechanics 43, 60 
Quantum number, basic 53 
see also Inner quantum number; 
Magnetic quantum number; 
Principal quantum number; 
Spin magnetic quantum num- 
ber; Spin quantum number; 
Vibrational quantum number 
Quantum theory 24, 26, 59, 75, 84 
of radiation 34-91, 79 
origins of 34-35 
Quarter-wave plate 109, 112, 115-121, 

129 
Quartz 31-32, 99-101, 122, 126, 129, 

135, 137, 192, 235 
Quartz-fluorite achromatic lens 208 
Quartz plate 176, 134 
Quartz prism 203, 204, 214 
Quartz tubes 182, 184 
Quartz-enveloped mercury arc lamp 

243 
Quinine iodosulphate 93 



Radiation 

and electromagnetic field 1-2 

broad band sources 243 

broadening 190 

detectors and principles of detec- 
tion 190-196, 239, 242, 244 

electromagnetic theory 1-33 

energy density 34-35 

infra-red sources 189 

monochromatic 245 

Planckian distribution 189 

quantum theory 34-91 

spectrum of 37 

stimulated emission 85-90 
light amplification by 331 

ultra-violet 322-323 

unwanted 244 

wavelengths, frequency and photon 
energies 179 
Radio methods 243 
Radio waves 92 

Raman effect 25, 45, 82-85, 349 
Raman scattering 83-84 
Raman spectroscopy 349 
Raman spectrum 84-85 
Rare earths 189 



Rayleigh, Lord 34, 189 
Rayleigh criterion 159, 175, 197 
Rayleigh effect 83 
Rayleigh scattering 83-84 
Reciprocity failure 232 
Reciprocity law 231 
Rectangular aperture 

diffraction of cylindrical wavefront 
by 266-267 

Fraunhofer diffraction due to 271 
Rectangular obstacle, diffraction of 
cylindrical wavefront by 267-268 
Red shifts 241 

Reduced mass of electron 46 
Reference stimuli 302 
Reflection 

at boundary between two media 
11-15 

first law of 15 

of light at a boundary 15-24 

polarisation of light by 93-96 

second law of 15 
Reflection coefficients 20-21, 23, 152, 

154, 155, 160, 172, 335, 337 
Reflection echelon grating 217-221 
Reflection filters 172 
Reflection grating 206-208, 217-221 
Reflectors, elliptical 342 
Refraction 

at boundary between two media 
11-15 

double 99, 106, 126, 143, 144 

first law of 15 

in uniaxial crystals 104 

of light at a boundary 15-24 

second law of 15 
Refractive index 11, 15, 17, 20, 23, 27, 
94, 105, 106, 139, 146, 157, 172, 
174, 199, 200, 218, 219, 224, 312 

function of wavelength 174 

variation with frequency 29, 30 

variation with wavelength 30, 32 
Relativity, equation of special theory 

of 49 
Relativity correction 49 
Resolving power 160-161, 173, 176, 

219 
Resonance 27 
Robens 244 
Rochon polariser 101 
Rotational energies 80-81 



380 



INDEX 



Rotational spectral bands 80-81 
Rotational-vibrational spectral bands 

80-81 
Rotatory dispersion 127, 129, 133 
Rowland circle 209, 211, 213, 214 
Rowland concave grating 208-211 
Rowland mounting 212, 214 
Ruby, energy levels in 340 
Ruby laser 340-342, 347, 348 
Russell-Saunders coupling 58, 59, 82 
Rutherford 38 
Rydberg 37 
Rydberg constant 41, 46, 47 



Saccharimeter 137 
Saccharimetry 132 
Saturation 297, 298 
Scattering 
of light 95 

electromagnetic theory of 24-26 
polarisation of light by 96 
types of 82-83 
Schawlow 333 
Schrodinger 60 

Schrodinger's equation 61-62, 64 
Schumann photographic plates 192 
Schwarz linear thermopile 193, 203 
Schwarzchild exponent 232 
Screens, complementary, Babinet 

principle of 275 
Sears 169 

Sellmeier's formula 30, 31 
Semiconductor devices 192 
Semiconductor materials 193, 344-345 
Sensitometric curve 230 
Servomechanism 236 
Shells 54, 55 
Slit, narrow, Fraunhofer diffraction 

due to 291-295 
Snell's law 15, 18, 22 
Sodium 56, 78, 103, 181, 183, 188, 

189 
Solar spectrum 179, 239 
Soleil compensator 125-126, 144 
Solution, concentration of 238 
Sommerfeld 48, 49 
Space quantisation 52, 73, 77, 82 
Spark. See Electric spark 
Spatial amplitude distribution 282, 

290, 293 



Spatial incoherence 325 
Specific rotation 129, 132 
Specific rotatory power 132 
Spectra 

absorption 80 

electronic 82 

molecular 79-82 

of alkali metals 50, 55-56 

of atoms due to transitions of two 
or three electrons 57-59 

of deuterium and of ionised atoms 
having single orbital electron 
47 

theory of, development of 37 ^5 
Spectral bands 80-81 
Spectral distribution 245, 308 
Spectral frequency spread 329 
Spectral lamps 182 
Spectral line 

from optical maser 330 

intensity distribution in 190 
Spectral line amplitude distribution 

328 
Spectral line broadening 189-190, 

327 
Spectral line width 326-329, 335-337 
Spectral notation 

of molecules 82 

of orbiting electrons in atoms 49 
Spectral reflectance 227 
Spectrograph 161 

calibration of 222 

definition 198 

Littrow plane grating 208 
Spectrography, photographic emul- 
sions 228-233 
Spectrometer 146, 196-197 

calibration of 222 

chromatic resolving power of 225 

constant-deviation 199-201 

definition 198 

grating 204-206 

Littrow prism 201-204 
Spectrometry, high-resolution 215- 

217 
Spectrophotometers 225, 234-236 

absorption 227, 234, 238-241 

double-beam diffraction grating re- 
cording 235 

Hilger-Nutting 233-234 
Spectrophotometry 225-228 



INDEX 



381 



Spectrophotometry continued 
absorption 227 

applications of 238-241 
in near infra-red 236 
photographic emulsions 228-233 
photographic plates, control 232 
Spectroscope, calibration of, Edser- 

Butler method 222-225 
Spectroscopy 179-248 
astronomical applications 241-242 
Fourier 244-246 

fundamental relation of 246 
grating, in far infra-red 243 
in far infra-red 242 
Raman 84, 349 
Spectrum line, fine structure of 47, 

49,50 
Spectrum locus 304, 306 
Spectrum of hydrogen atom, Bohr 

theory of 37-43, 45-46 
Spectrum of radiation 37 
Speed of light 6-7 
Spherical aberration 323 
Spherical wavefront in circular aper- 
ture 255-257 
Kirchhoff 's theory of diffraction of 
279-281 
Spin magnetic quantum number 52, 

54- 
Spin quantum number 50, 57, 59, 82 
Spontaneous emission time 86 
Standards of wavelength 221 
Stark effect 45, 79, 242 
Stern 52 

Stimulated emission 
four-energy-level system 339 
in pulsed lasers 345 
in semiconductors 345 
of radiation 85-90 

light amplification by 331 
three-energy-level system 338 
Stokes lines 84-85 

Straight edge, diffraction of cylindri- 
cal wavefront by 265 
Strain measurement by polarisation 

methods 143-144 
Stress distribution 144 
Structure, chemical 238 
Submillimetre microwave methods 

243 
Sub-shells 54, 55 



Sub-stage condenser 313 
Sugar solutions 132, 137 
Sun's chromosphere 239 
Superposition, principle of 283 



Temporal incoherence 325-326 
Thermal detectors 193 
Thermistors 191 
Thermojunction 191, 193 
Thermopile 191, 193-195, 203 
Thin films, interference in 285-287 
Thomson, G. P. 60 
Titanium dioxide 172 
'Top-hat' function 293, 294, 365 
'Top-hat' waveform 328, 329 
Townes 333 

Transmission coefficient 152, 154-155 
Transmission echelon 217 
Transmittance 230 

recording 236 
Transmittance-wavelength curve 234, 

308 
Transverse effect 68 
Trichromatic coefficients 304-306 
Tristimulus value 300, 301, 304, 305 
Twyman and Green interferometer 

347 



Uhlenbeck 50 

Ultra-violet microscope 322-323 

Uniaxial crystals, interference effects 

in 138-143 
Unicam prism absorptiometer 235 
Units, m.k.s. system 6-7 
Vacuum spectrographs 201, 214 
Velocity of elecromagnetic radiation 

and refractive index 11 
Velocity of light in free space 367 
Verdet's constant 130 
Vibrational energies 80-81 
Vibrational quantum number 81 
Visibility technique 245 



Wadsworth mounting 214-215 

Wave equation 8, 65 

Wave function, modulus of complex 

amplitude of 64 
Wave mechanics 60, 62 



382 



INDEX 



Wave motion 63, 64 
Wave number 37, 43, 47, 79, 243 
Wavefront 
calculating shape of 104-105 
cylindrical 260 
diffraction of, by straight edge 
265 
division of 284, 285 
plane, in circular aperture, Fresnel's 

treatment of 250-254 
spherical, in circular aperture 255- 
257 
Kirchhoflf's theory of diffraction 
of 279-281 
Wavelengths 
characteristic absorption 241 
dominant 298, 306 
standards of 221 
Welding, optical 347 
White light 299, 304, 305 
Wilson 48, 49 



Wire grid 97-98 
Wollaston polariser 100 
Wood, R. W. 84 
Wright colorimeter 309 



Xenon lamps 342 
X-radiation 31 
Xylonite 143 



Young's double slit arrangement 356 
Yttrium oxide 189 



Zeeman effect 45, 50, 52, 68-79, 161, 
220, 242 

anomalous 70, 75-79 

normal 70-75 
Zernike 318 
Zone plate 257-260 



You should a/so read 

UNIVERSITY OPTICS— Volume 1 

D. W. Tenquist, MSc. R. M. Whittle, BSc, PhD, and J. Yarwood, 

MSc, FlnstP 

8f 5i (222 143 mm) 350 pages 166 illustrations 

70s. net (£350) 

592 05054 8 

CONTENTS 

Lenses and Systems of Lenses. Aberrations of Optical Components 
and Systems. Optical Instruments, Refractometry, The Velocity of 
Light. Photometry Wave Theory of Light. Interference Between Two 
Beams of Light Involving Division of Wavefront. Interference Between 
Two Beams of Light: Division of Amplitude by Parallel Plate or Film. 
Interference Between Two Beams of Light: Division of Amplitude by 
Semi-Transparent Reflector. Diffraction of Light: Introductory Theory 
and Practice of Fraunhofer Diffraction. Resolving Power. Answers to 
Numerical Exercises. 

PHYSICS— Volume 1 

Z. KAMINSKI. English translation edited by R. A. Giles, BSc, PhD 

8F H" (222 143 mm.) 354 pages. 303 illustrations 
50s. net (£2-50) 
592 05045 9 cased 
35s. net (£1 75) 

592 05072 6 limp 

CONTENTS 

Measurement. Statics, Kinematics. Dynamics. The Mechanical Proper- 
ties of Liquids and Gases. Heat. The Perfect Gas and the Gas Laws, 
The Principles of Thermodynamics. The Atomic Theory of the Structure 
of Matter. Elasticity of Bodies- Vibrations and Waves. Acoustics 
Optics. 



PHYSICS— Volume 2 

Z. Kaminski and R. A, Giles, BSc, PhD 

8J" 5± (222 143 mm.) 228 pages 175 illustrations 

45s. net (£2-25) 

592 05060 2 cased 

30s. net (£1-50) 

592 05073 4 limp 

CONTENTS 

Electrostatics, Electric Current. The Magnetic Field. Electromagnetic 
Induction, Electromagnetic Oscillations and Electromagnetic Waves. 
Electrical Phenomena in Gases and Vacuum, Electrical Conduction in 
Solids. The Structure of the Atom. Radioactivity, The Special Theory of 
Relativity. Nuclear Energy. 

THE BUTTERWORTH GROUP 88 Kingsway, London, W.C.2 



